Dataset statistics
| Number of variables | 32 |
|---|---|
| Number of observations | 112372 |
| Missing cells | 173691 |
| Missing cells (%) | 4.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 32.3 MiB |
| Average record size in memory | 301.6 B |
Variable types
| Categorical | 21 |
|---|---|
| Numeric | 11 |
customer_id has a high cardinality: 97917 distinct values | High cardinality |
customer_unique_id has a high cardinality: 94721 distinct values | High cardinality |
customer_city has a high cardinality: 4108 distinct values | High cardinality |
order_id has a high cardinality: 97917 distinct values | High cardinality |
order_purchase_timestamp has a high cardinality: 97371 distinct values | High cardinality |
order_approved_at has a high cardinality: 89534 distinct values | High cardinality |
order_delivered_carrier_date has a high cardinality: 80450 distinct values | High cardinality |
order_delivered_customer_date has a high cardinality: 95022 distinct values | High cardinality |
order_estimated_delivery_date has a high cardinality: 450 distinct values | High cardinality |
product_id has a high cardinality: 32789 distinct values | High cardinality |
seller_id has a high cardinality: 3090 distinct values | High cardinality |
shipping_limit_date has a high cardinality: 92643 distinct values | High cardinality |
review_id has a high cardinality: 97709 distinct values | High cardinality |
review_comment_title has a high cardinality: 4497 distinct values | High cardinality |
review_comment_message has a high cardinality: 35692 distinct values | High cardinality |
review_creation_date has a high cardinality: 633 distinct values | High cardinality |
review_answer_timestamp has a high cardinality: 97547 distinct values | High cardinality |
product_category_name has a high cardinality: 73 distinct values | High cardinality |
customer_zip_code_prefix is highly overall correlated with customer_state | High correlation |
price is highly overall correlated with product_weight_g | High correlation |
product_weight_g is highly overall correlated with price and 3 other fields | High correlation |
product_length_cm is highly overall correlated with product_weight_g and 1 other fields | High correlation |
product_height_cm is highly overall correlated with product_weight_g | High correlation |
product_width_cm is highly overall correlated with product_weight_g and 1 other fields | High correlation |
customer_state is highly overall correlated with customer_zip_code_prefix | High correlation |
order_status is highly imbalanced (93.4%) | Imbalance |
order_delivered_carrier_date has 1184 (1.1%) missing values | Missing |
order_delivered_customer_date has 2360 (2.1%) missing values | Missing |
review_comment_title has 98938 (88.0%) missing values | Missing |
review_comment_message has 64730 (57.6%) missing values | Missing |
product_category_name has 1598 (1.4%) missing values | Missing |
product_name_lenght has 1598 (1.4%) missing values | Missing |
product_description_lenght has 1598 (1.4%) missing values | Missing |
product_photos_qty has 1598 (1.4%) missing values | Missing |
customer_id is uniformly distributed | Uniform |
customer_unique_id is uniformly distributed | Uniform |
order_id is uniformly distributed | Uniform |
order_purchase_timestamp is uniformly distributed | Uniform |
order_approved_at is uniformly distributed | Uniform |
order_delivered_carrier_date is uniformly distributed | Uniform |
order_delivered_customer_date is uniformly distributed | Uniform |
shipping_limit_date is uniformly distributed | Uniform |
review_id is uniformly distributed | Uniform |
review_answer_timestamp is uniformly distributed | Uniform |
Reproduction
| Analysis started | 2023-02-09 08:10:26.570667 |
|---|---|
| Analysis finished | 2023-02-09 08:11:05.348272 |
| Duration | 38.78 seconds |
| Software version | pandas-profiling vv3.6.2 |
| Download configuration | config.json |
customer_id
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 97917 |
|---|---|
| Distinct (%) | 87.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| be1c4e52bb71e0c54b11a26b8e8d59f2 | 22 |
|---|---|
| fc3d1daec319d62d49bfb5e1f83123e9 | 21 |
| be1b70680b9f9694d8c70f41fa3dc92b | 20 |
| adb32467ecc74b53576d9d13a5a55891 | 15 |
| 10de381f8a8d23fff822753305f71cae | 15 |
| Other values (97912) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3595904 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 87758 ? |
|---|---|
| Unique (%) | 78.1% |
Sample
| 1st row | 06b8999e2fba1a1fbc88172c00ba8bc7 |
|---|---|
| 2nd row | 8912fc0c3bbf1e2fbf35819e21706718 |
| 3rd row | 8912fc0c3bbf1e2fbf35819e21706718 |
| 4th row | f0ac8e5a239118859b1734e1087cbb1f |
| 5th row | 6bc8d08963a135220ed6c6d098831f84 |
Common Values
| Value | Count | Frequency (%) |
| be1c4e52bb71e0c54b11a26b8e8d59f2 | 22 | < 0.1% |
| fc3d1daec319d62d49bfb5e1f83123e9 | 21 | < 0.1% |
| be1b70680b9f9694d8c70f41fa3dc92b | 20 | < 0.1% |
| adb32467ecc74b53576d9d13a5a55891 | 15 | < 0.1% |
| 10de381f8a8d23fff822753305f71cae | 15 | < 0.1% |
| a7693fba2ff9583c78751f2b66ecab9d | 14 | < 0.1% |
| d5f2b3f597c7ccafbb5cac0bcc3d6024 | 14 | < 0.1% |
| 7d321bd4e8ba1caf74c4c1aabd9ae524 | 13 | < 0.1% |
| daf15f1b940cc6a72ba558f093dc00dd | 12 | < 0.1% |
| 91f92cfee46b79581b05aa974dd57ce5 | 12 | < 0.1% |
| Other values (97907) | 112214 |
Length
| Value | Count | Frequency (%) |
| be1c4e52bb71e0c54b11a26b8e8d59f2 | 22 | < 0.1% |
| fc3d1daec319d62d49bfb5e1f83123e9 | 21 | < 0.1% |
| be1b70680b9f9694d8c70f41fa3dc92b | 20 | < 0.1% |
| adb32467ecc74b53576d9d13a5a55891 | 15 | < 0.1% |
| 10de381f8a8d23fff822753305f71cae | 15 | < 0.1% |
| a7693fba2ff9583c78751f2b66ecab9d | 14 | < 0.1% |
| d5f2b3f597c7ccafbb5cac0bcc3d6024 | 14 | < 0.1% |
| 7d321bd4e8ba1caf74c4c1aabd9ae524 | 13 | < 0.1% |
| 9eb3d566e87289dcb0acf28e1407c839 | 12 | < 0.1% |
| 0d93f21f3e8543a9d0d8ece01561f5b2 | 12 | < 0.1% |
| Other values (97907) | 112214 |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 225399 | 6.3% |
| c | 225289 | 6.3% |
| 5 | 225194 | 6.3% |
| 1 | 225081 | 6.3% |
| 6 | 224917 | 6.3% |
| 8 | 224874 | 6.3% |
| 2 | 224823 | 6.3% |
| 7 | 224778 | 6.3% |
| a | 224719 | 6.2% |
| b | 224709 | 6.2% |
| Other values (6) | 1346121 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2246846 | |
| Lowercase Letter | 1349058 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 225194 | |
| 1 | 225081 | |
| 6 | 224917 | |
| 8 | 224874 | |
| 2 | 224823 | |
| 7 | 224778 | |
| 9 | 224677 | |
| 3 | 224639 | |
| 4 | 223963 | |
| 0 | 223900 |
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 225399 | |
| c | 225289 | |
| a | 224719 | |
| b | 224709 | |
| e | 224522 | |
| d | 224420 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2246846 | |
| Latin | 1349058 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 225194 | |
| 1 | 225081 | |
| 6 | 224917 | |
| 8 | 224874 | |
| 2 | 224823 | |
| 7 | 224778 | |
| 9 | 224677 | |
| 3 | 224639 | |
| 4 | 223963 | |
| 0 | 223900 |
Latin
| Value | Count | Frequency (%) |
| f | 225399 | |
| c | 225289 | |
| a | 224719 | |
| b | 224709 | |
| e | 224522 | |
| d | 224420 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3595904 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 225399 | 6.3% |
| c | 225289 | 6.3% |
| 5 | 225194 | 6.3% |
| 1 | 225081 | 6.3% |
| 6 | 224917 | 6.3% |
| 8 | 224874 | 6.3% |
| 2 | 224823 | 6.3% |
| 7 | 224778 | 6.3% |
| a | 224719 | 6.2% |
| b | 224709 | 6.2% |
| Other values (6) | 1346121 |
customer_unique_id
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 94721 |
|---|---|
| Distinct (%) | 84.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| d97b3cfb22b0d6b25ac9ed4e9c2d481b | 24 |
|---|---|
| c8460e4251689ba205045f3ea17884a1 | 24 |
| 4546caea018ad8c692964e3382debd19 | 21 |
| c402f431464c72e27330a67f7b94d4fb | 20 |
| 0f5ac8d5c31de21d2f25e24be15bbffb | 18 |
| Other values (94716) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3595904 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 82917 ? |
|---|---|
| Unique (%) | 73.8% |
Sample
| 1st row | 861eff4711a542e4b93843c6dd7febb0 |
|---|---|
| 2nd row | 9eae34bbd3a474ec5d07949ca7de67c0 |
| 3rd row | 9eae34bbd3a474ec5d07949ca7de67c0 |
| 4th row | 3c799d181c34d51f6d44bbbc563024db |
| 5th row | 23397e992b09769faf5e66f9e171a241 |
Common Values
| Value | Count | Frequency (%) |
| d97b3cfb22b0d6b25ac9ed4e9c2d481b | 24 | < 0.1% |
| c8460e4251689ba205045f3ea17884a1 | 24 | < 0.1% |
| 4546caea018ad8c692964e3382debd19 | 21 | < 0.1% |
| c402f431464c72e27330a67f7b94d4fb | 20 | < 0.1% |
| 0f5ac8d5c31de21d2f25e24be15bbffb | 18 | < 0.1% |
| 8d50f5eadf50201ccdcedfb9e2ac8455 | 16 | < 0.1% |
| 11f97da02237a49c8e783dfda6f50e8e | 15 | < 0.1% |
| eae0a83d752b1dd32697e0e7b4221656 | 15 | < 0.1% |
| 33176de67c05eeed870fd49f234387a0 | 15 | < 0.1% |
| 31e412b9fb766b6794724ed17a41dfa6 | 14 | < 0.1% |
| Other values (94711) | 112190 |
Length
| Value | Count | Frequency (%) |
| d97b3cfb22b0d6b25ac9ed4e9c2d481b | 24 | < 0.1% |
| c8460e4251689ba205045f3ea17884a1 | 24 | < 0.1% |
| 4546caea018ad8c692964e3382debd19 | 21 | < 0.1% |
| c402f431464c72e27330a67f7b94d4fb | 20 | < 0.1% |
| 0f5ac8d5c31de21d2f25e24be15bbffb | 18 | < 0.1% |
| 8d50f5eadf50201ccdcedfb9e2ac8455 | 16 | < 0.1% |
| 11f97da02237a49c8e783dfda6f50e8e | 15 | < 0.1% |
| eae0a83d752b1dd32697e0e7b4221656 | 15 | < 0.1% |
| 33176de67c05eeed870fd49f234387a0 | 15 | < 0.1% |
| dee6a650840de087ac42c4367bc9baf3 | 14 | < 0.1% |
| Other values (94711) | 112190 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 225520 | 6.3% |
| 1 | 225456 | 6.3% |
| e | 225081 | 6.3% |
| 8 | 224933 | 6.3% |
| b | 224899 | 6.3% |
| d | 224862 | 6.3% |
| a | 224854 | 6.3% |
| 5 | 224792 | 6.3% |
| 9 | 224775 | 6.3% |
| 2 | 224680 | 6.2% |
| Other values (6) | 1346052 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2247942 | |
| Lowercase Letter | 1347962 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 225520 | |
| 1 | 225456 | |
| 8 | 224933 | |
| 5 | 224792 | |
| 9 | 224775 | |
| 2 | 224680 | |
| 3 | 224650 | |
| 0 | 224625 | |
| 7 | 224466 | |
| 4 | 224045 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 225081 | |
| b | 224899 | |
| d | 224862 | |
| a | 224854 | |
| f | 224279 | |
| c | 223987 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2247942 | |
| Latin | 1347962 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 225520 | |
| 1 | 225456 | |
| 8 | 224933 | |
| 5 | 224792 | |
| 9 | 224775 | |
| 2 | 224680 | |
| 3 | 224650 | |
| 0 | 224625 | |
| 7 | 224466 | |
| 4 | 224045 |
Latin
| Value | Count | Frequency (%) |
| e | 225081 | |
| b | 224899 | |
| d | 224862 | |
| a | 224854 | |
| f | 224279 | |
| c | 223987 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3595904 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 225520 | 6.3% |
| 1 | 225456 | 6.3% |
| e | 225081 | 6.3% |
| 8 | 224933 | 6.3% |
| b | 224899 | 6.3% |
| d | 224862 | 6.3% |
| a | 224854 | 6.3% |
| 5 | 224792 | 6.3% |
| 9 | 224775 | 6.3% |
| 2 | 224680 | 6.2% |
| Other values (6) | 1346052 |
customer_zip_code_prefix
Real number (ℝ)
| Distinct | 14955 |
|---|---|
| Distinct (%) | 13.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35131.881 |
| Minimum | 1003 |
|---|---|
| Maximum | 99990 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 1003 |
|---|---|
| 5-th percentile | 3308 |
| Q1 | 11250 |
| median | 24320 |
| Q3 | 59063 |
| 95-th percentile | 90630 |
| Maximum | 99990 |
| Range | 98987 |
| Interquartile range (IQR) | 47813 |
Descriptive statistics
| Standard deviation | 29894.588 |
|---|---|
| Coefficient of variation (CV) | 0.85092477 |
| Kurtosis | -0.79564531 |
| Mean | 35131.881 |
| Median Absolute Deviation (MAD) | 16406 |
| Skewness | 0.77954883 |
| Sum | 3.9478397 × 109 |
| Variance | 8.9368637 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 22790 | 151 | 0.1% |
| 22793 | 150 | 0.1% |
| 24220 | 145 | 0.1% |
| 24230 | 137 | 0.1% |
| 22775 | 124 | 0.1% |
| 35162 | 112 | 0.1% |
| 29101 | 106 | 0.1% |
| 11740 | 106 | 0.1% |
| 13212 | 105 | 0.1% |
| 13087 | 104 | 0.1% |
| Other values (14945) | 111132 |
| Value | Count | Frequency (%) |
| 1003 | 1 | < 0.1% |
| 1004 | 2 | < 0.1% |
| 1005 | 6 | |
| 1006 | 2 | < 0.1% |
| 1007 | 4 | |
| 1008 | 3 | < 0.1% |
| 1009 | 8 | |
| 1011 | 6 | |
| 1012 | 2 | < 0.1% |
| 1013 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 99990 | 1 | < 0.1% |
| 99980 | 3 | < 0.1% |
| 99970 | 1 | < 0.1% |
| 99965 | 2 | < 0.1% |
| 99960 | 1 | < 0.1% |
| 99955 | 3 | < 0.1% |
| 99950 | 9 | |
| 99940 | 2 | < 0.1% |
| 99930 | 5 | |
| 99925 | 1 | < 0.1% |
customer_city
Categorical
| Distinct | 4108 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| sao paulo | |
|---|---|
| rio de janeiro | 7786 |
| belo horizonte | 3150 |
| brasilia | 2402 |
| curitiba | 1749 |
| Other values (4103) |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 10.338643 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1161774 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1063 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | franca |
|---|---|
| 2nd row | santarem |
| 3rd row | santarem |
| 4th row | nova santa rita |
| 5th row | mage |
Common Values
| Value | Count | Frequency (%) |
| sao paulo | 17794 | 15.8% |
| rio de janeiro | 7786 | 6.9% |
| belo horizonte | 3150 | 2.8% |
| brasilia | 2402 | 2.1% |
| curitiba | 1749 | 1.6% |
| campinas | 1642 | 1.5% |
| porto alegre | 1616 | 1.4% |
| salvador | 1393 | 1.2% |
| guarulhos | 1314 | 1.2% |
| sao bernardo do campo | 1067 | 0.9% |
| Other values (4098) | 72459 |
Length
| Value | Count | Frequency (%) |
| sao | 23948 | 12.2% |
| paulo | 17873 | 9.1% |
| de | 10911 | 5.5% |
| rio | 9372 | 4.8% |
| janeiro | 7786 | 4.0% |
| do | 4858 | 2.5% |
| belo | 3220 | 1.6% |
| horizonte | 3178 | 1.6% |
| brasilia | 2412 | 1.2% |
| porto | 1918 | 1.0% |
| Other values (3279) | 111504 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 191445 | |
| o | 143516 | |
| i | 88922 | 7.7% |
| r | 86281 | 7.4% |
| 84608 | 7.3% | |
| e | 75418 | 6.5% |
| s | 71066 | 6.1% |
| n | 51541 | 4.4% |
| u | 50894 | 4.4% |
| l | 50710 | 4.4% |
| Other values (21) | 267373 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1076645 | |
| Space Separator | 84608 | 7.3% |
| Dash Punctuation | 269 | < 0.1% |
| Other Punctuation | 250 | < 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 191445 | |
| o | 143516 | |
| i | 88922 | 8.3% |
| r | 86281 | 8.0% |
| e | 75418 | 7.0% |
| s | 71066 | 6.6% |
| n | 51541 | 4.8% |
| u | 50894 | 4.7% |
| l | 50710 | 4.7% |
| p | 42279 | 3.9% |
| Other values (16) | 224573 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 4 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 84608 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 269 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 250 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1076645 | |
| Common | 85129 | 7.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 191445 | |
| o | 143516 | |
| i | 88922 | 8.3% |
| r | 86281 | 8.0% |
| e | 75418 | 7.0% |
| s | 71066 | 6.6% |
| n | 51541 | 4.8% |
| u | 50894 | 4.7% |
| l | 50710 | 4.7% |
| p | 42279 | 3.9% |
| Other values (16) | 224573 |
Common
| Value | Count | Frequency (%) |
| 84608 | ||
| - | 269 | 0.3% |
| ' | 250 | 0.3% |
| 1 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1161774 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 191445 | |
| o | 143516 | |
| i | 88922 | 7.7% |
| r | 86281 | 7.4% |
| 84608 | 7.3% | |
| e | 75418 | 6.5% |
| s | 71066 | 6.1% |
| n | 51541 | 4.4% |
| u | 50894 | 4.4% |
| l | 50710 | 4.4% |
| Other values (21) | 267373 |
customer_state
Categorical
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| SP | |
|---|---|
| RJ | |
| MG | |
| RS | |
| PR | |
| Other values (22) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 224744 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SP |
|---|---|
| 2nd row | PA |
| 3rd row | PA |
| 4th row | RS |
| 5th row | RJ |
Common Values
| Value | Count | Frequency (%) |
| SP | 47399 | |
| RJ | 14468 | 12.9% |
| MG | 13110 | 11.7% |
| RS | 6265 | 5.6% |
| PR | 5736 | 5.1% |
| SC | 4156 | 3.7% |
| BA | 3766 | 3.4% |
| DF | 2416 | 2.2% |
| GO | 2316 | 2.1% |
| ES | 2237 | 2.0% |
| Other values (17) | 10503 | 9.3% |
Length
| Value | Count | Frequency (%) |
| sp | 47399 | |
| rj | 14468 | 12.9% |
| mg | 13110 | 11.7% |
| rs | 6265 | 5.6% |
| pr | 5736 | 5.1% |
| sc | 4156 | 3.7% |
| ba | 3766 | 3.4% |
| df | 2416 | 2.2% |
| go | 2316 | 2.1% |
| es | 2237 | 2.0% |
| Other values (17) | 10503 | 9.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 61269 | |
| P | 57213 | |
| R | 27377 | |
| M | 15973 | 7.1% |
| G | 15426 | 6.9% |
| J | 14468 | 6.4% |
| A | 6438 | 2.9% |
| E | 5888 | 2.6% |
| C | 5720 | 2.5% |
| B | 4362 | 1.9% |
| Other values (7) | 10610 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 224744 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 61269 | |
| P | 57213 | |
| R | 27377 | |
| M | 15973 | 7.1% |
| G | 15426 | 6.9% |
| J | 14468 | 6.4% |
| A | 6438 | 2.9% |
| E | 5888 | 2.6% |
| C | 5720 | 2.5% |
| B | 4362 | 1.9% |
| Other values (7) | 10610 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 224744 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 61269 | |
| P | 57213 | |
| R | 27377 | |
| M | 15973 | 7.1% |
| G | 15426 | 6.9% |
| J | 14468 | 6.4% |
| A | 6438 | 2.9% |
| E | 5888 | 2.6% |
| C | 5720 | 2.5% |
| B | 4362 | 1.9% |
| Other values (7) | 10610 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 224744 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 61269 | |
| P | 57213 | |
| R | 27377 | |
| M | 15973 | 7.1% |
| G | 15426 | 6.9% |
| J | 14468 | 6.4% |
| A | 6438 | 2.9% |
| E | 5888 | 2.6% |
| C | 5720 | 2.5% |
| B | 4362 | 1.9% |
| Other values (7) | 10610 | 4.7% |
order_id
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 97917 |
|---|---|
| Distinct (%) | 87.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| 5a3b1c29a49756e75f1ef513383c0c12 | 22 |
|---|---|
| 8272b63d03f5f79c56e9e4120aec44ef | 21 |
| 1b15974a0141d54e36626dca3fdc731a | 20 |
| 9ef13efd6949e4573a18964dd1bbe7f5 | 15 |
| 428a2f660dc84138d969ccd69a0ab6d5 | 15 |
| Other values (97912) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3595904 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 87758 ? |
|---|---|
| Unique (%) | 78.1% |
Sample
| 1st row | 00e7ee1b050b8499577073aeb2a297a1 |
|---|---|
| 2nd row | c1d2b34febe9cd269e378117d6681172 |
| 3rd row | c1d2b34febe9cd269e378117d6681172 |
| 4th row | b1a5d5365d330d10485e0203d54ab9e8 |
| 5th row | 2e604b3614664aa66867856dba7e61b7 |
Common Values
| Value | Count | Frequency (%) |
| 5a3b1c29a49756e75f1ef513383c0c12 | 22 | < 0.1% |
| 8272b63d03f5f79c56e9e4120aec44ef | 21 | < 0.1% |
| 1b15974a0141d54e36626dca3fdc731a | 20 | < 0.1% |
| 9ef13efd6949e4573a18964dd1bbe7f5 | 15 | < 0.1% |
| 428a2f660dc84138d969ccd69a0ab6d5 | 15 | < 0.1% |
| 9bdc4d4c71aa1de4606060929dee888c | 14 | < 0.1% |
| 73c8ab38f07dc94389065f7eba4f297a | 14 | < 0.1% |
| 37ee401157a3a0b28c9c6d0ed8c3b24b | 13 | < 0.1% |
| 637617b3ffe9e2f7a2411243829226d0 | 12 | < 0.1% |
| 3a213fcdfe7d98be74ea0dc05a8b31ae | 12 | < 0.1% |
| Other values (97907) | 112214 |
Length
| Value | Count | Frequency (%) |
| 5a3b1c29a49756e75f1ef513383c0c12 | 22 | < 0.1% |
| 8272b63d03f5f79c56e9e4120aec44ef | 21 | < 0.1% |
| 1b15974a0141d54e36626dca3fdc731a | 20 | < 0.1% |
| 9ef13efd6949e4573a18964dd1bbe7f5 | 15 | < 0.1% |
| 428a2f660dc84138d969ccd69a0ab6d5 | 15 | < 0.1% |
| 9bdc4d4c71aa1de4606060929dee888c | 14 | < 0.1% |
| 73c8ab38f07dc94389065f7eba4f297a | 14 | < 0.1% |
| 37ee401157a3a0b28c9c6d0ed8c3b24b | 13 | < 0.1% |
| af822dacd6f5cff7376413c03a388bb7 | 12 | < 0.1% |
| 2c2a19b5703863c908512d135aa6accc | 12 | < 0.1% |
| Other values (97907) | 112214 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 225874 | 6.3% |
| 6 | 225416 | 6.3% |
| e | 225362 | 6.3% |
| b | 225319 | 6.3% |
| 7 | 225255 | 6.3% |
| 3 | 225141 | 6.3% |
| a | 224944 | 6.3% |
| 2 | 224842 | 6.3% |
| 8 | 224819 | 6.3% |
| 1 | 224818 | 6.3% |
| Other values (6) | 1344114 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2247355 | |
| Lowercase Letter | 1348549 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 225874 | |
| 6 | 225416 | |
| 7 | 225255 | |
| 3 | 225141 | |
| 2 | 224842 | |
| 8 | 224819 | |
| 1 | 224818 | |
| 9 | 224121 | |
| 0 | 223894 | |
| 5 | 223175 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 225362 | |
| b | 225319 | |
| a | 224944 | |
| c | 224756 | |
| f | 224517 | |
| d | 223651 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2247355 | |
| Latin | 1348549 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 225874 | |
| 6 | 225416 | |
| 7 | 225255 | |
| 3 | 225141 | |
| 2 | 224842 | |
| 8 | 224819 | |
| 1 | 224818 | |
| 9 | 224121 | |
| 0 | 223894 | |
| 5 | 223175 |
Latin
| Value | Count | Frequency (%) |
| e | 225362 | |
| b | 225319 | |
| a | 224944 | |
| c | 224756 | |
| f | 224517 | |
| d | 223651 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3595904 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 225874 | 6.3% |
| 6 | 225416 | 6.3% |
| e | 225362 | 6.3% |
| b | 225319 | 6.3% |
| 7 | 225255 | 6.3% |
| 3 | 225141 | 6.3% |
| a | 224944 | 6.3% |
| 2 | 224842 | 6.3% |
| 8 | 224819 | 6.3% |
| 1 | 224818 | 6.3% |
| Other values (6) | 1344114 |
order_status
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| delivered | |
|---|---|
| shipped | 1110 |
| canceled | 529 |
| invoiced | 358 |
| processing | 352 |
| Other values (2) | 10 |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 8.9755811 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1008604 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | delivered |
|---|---|
| 2nd row | delivered |
| 3rd row | delivered |
| 4th row | delivered |
| 5th row | delivered |
Common Values
| Value | Count | Frequency (%) |
| delivered | 110013 | |
| shipped | 1110 | 1.0% |
| canceled | 529 | 0.5% |
| invoiced | 358 | 0.3% |
| processing | 352 | 0.3% |
| unavailable | 7 | < 0.1% |
| approved | 3 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| delivered | 110013 | |
| shipped | 1110 | 1.0% |
| canceled | 529 | 0.5% |
| invoiced | 358 | 0.3% |
| processing | 352 | 0.3% |
| unavailable | 7 | < 0.1% |
| approved | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 332927 | |
| d | 222026 | |
| i | 112198 | 11.1% |
| l | 110556 | 11.0% |
| v | 110381 | 10.9% |
| r | 110368 | 10.9% |
| p | 2578 | 0.3% |
| s | 1814 | 0.2% |
| c | 1768 | 0.2% |
| n | 1246 | 0.1% |
| Other values (6) | 2742 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1008604 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 332927 | |
| d | 222026 | |
| i | 112198 | 11.1% |
| l | 110556 | 11.0% |
| v | 110381 | 10.9% |
| r | 110368 | 10.9% |
| p | 2578 | 0.3% |
| s | 1814 | 0.2% |
| c | 1768 | 0.2% |
| n | 1246 | 0.1% |
| Other values (6) | 2742 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1008604 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 332927 | |
| d | 222026 | |
| i | 112198 | 11.1% |
| l | 110556 | 11.0% |
| v | 110381 | 10.9% |
| r | 110368 | 10.9% |
| p | 2578 | 0.3% |
| s | 1814 | 0.2% |
| c | 1768 | 0.2% |
| n | 1246 | 0.1% |
| Other values (6) | 2742 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1008604 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 332927 | |
| d | 222026 | |
| i | 112198 | 11.1% |
| l | 110556 | 11.0% |
| v | 110381 | 10.9% |
| r | 110368 | 10.9% |
| p | 2578 | 0.3% |
| s | 1814 | 0.2% |
| c | 1768 | 0.2% |
| n | 1246 | 0.1% |
| Other values (6) | 2742 | 0.3% |
order_purchase_timestamp
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 97371 |
|---|---|
| Distinct (%) | 86.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| 2017-10-17 13:06:29 | 22 |
|---|---|
| 2017-07-16 18:19:25 | 21 |
| 2018-02-22 15:30:41 | 20 |
| 2017-01-30 21:44:49 | 15 |
| 2017-11-23 20:30:52 | 15 |
| Other values (97366) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 2135068 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 86925 ? |
|---|---|
| Unique (%) | 77.4% |
Sample
| 1st row | 2017-05-16 15:05:35 |
|---|---|
| 2nd row | 2017-11-09 00:50:13 |
| 3rd row | 2017-11-09 00:50:13 |
| 4th row | 2017-05-07 20:11:26 |
| 5th row | 2018-02-03 19:45:40 |
Common Values
| Value | Count | Frequency (%) |
| 2017-10-17 13:06:29 | 22 | < 0.1% |
| 2017-07-16 18:19:25 | 21 | < 0.1% |
| 2018-02-22 15:30:41 | 20 | < 0.1% |
| 2017-01-30 21:44:49 | 15 | < 0.1% |
| 2017-11-23 20:30:52 | 15 | < 0.1% |
| 2017-12-13 14:21:15 | 14 | < 0.1% |
| 2018-02-21 11:45:07 | 14 | < 0.1% |
| 2018-04-12 11:02:51 | 13 | < 0.1% |
| 2017-09-22 17:41:49 | 12 | < 0.1% |
| 2018-01-12 16:19:31 | 12 | < 0.1% |
| Other values (97361) | 112214 |
Length
| Value | Count | Frequency (%) |
| 2017-11-24 | 1366 | 0.6% |
| 2017-11-25 | 580 | 0.3% |
| 2017-11-27 | 480 | 0.2% |
| 2017-11-26 | 452 | 0.2% |
| 2018-08-06 | 430 | 0.2% |
| 2018-08-07 | 429 | 0.2% |
| 2017-11-28 | 428 | 0.2% |
| 2018-05-15 | 420 | 0.2% |
| 2018-05-07 | 417 | 0.2% |
| 2018-05-14 | 412 | 0.2% |
| Other values (51075) | 219330 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 347765 | |
| 0 | 345779 | |
| 2 | 273979 | |
| - | 224744 | |
| : | 224744 | |
| 8 | 117176 | 5.5% |
| 112372 | 5.3% | |
| 7 | 104069 | 4.9% |
| 3 | 99198 | 4.6% |
| 5 | 90833 | 4.3% |
| Other values (3) | 194409 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1573208 | |
| Dash Punctuation | 224744 | 10.5% |
| Other Punctuation | 224744 | 10.5% |
| Space Separator | 112372 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 347765 | |
| 0 | 345779 | |
| 2 | 273979 | |
| 8 | 117176 | 7.4% |
| 7 | 104069 | 6.6% |
| 3 | 99198 | 6.3% |
| 5 | 90833 | 5.8% |
| 4 | 90807 | 5.8% |
| 6 | 53371 | 3.4% |
| 9 | 50231 | 3.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 224744 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 224744 |
Space Separator
| Value | Count | Frequency (%) |
| 112372 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2135068 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 347765 | |
| 0 | 345779 | |
| 2 | 273979 | |
| - | 224744 | |
| : | 224744 | |
| 8 | 117176 | 5.5% |
| 112372 | 5.3% | |
| 7 | 104069 | 4.9% |
| 3 | 99198 | 4.6% |
| 5 | 90833 | 4.3% |
| Other values (3) | 194409 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2135068 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 347765 | |
| 0 | 345779 | |
| 2 | 273979 | |
| - | 224744 | |
| : | 224744 | |
| 8 | 117176 | 5.5% |
| 112372 | 5.3% | |
| 7 | 104069 | 4.9% |
| 3 | 99198 | 4.6% |
| 5 | 90833 | 4.3% |
| Other values (3) | 194409 |
order_approved_at
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 89534 |
|---|---|
| Distinct (%) | 79.7% |
| Missing | 15 |
| Missing (%) | < 0.1% |
| Memory size | 1.7 MiB |
| 2018-02-24 03:20:27 | 23 |
|---|---|
| 2017-10-18 13:06:21 | 22 |
| 2017-07-17 18:25:23 | 21 |
| 2017-01-30 22:33:45 | 15 |
| 2017-12-15 02:30:41 | 15 |
| Other values (89529) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 2134783 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 74068 ? |
|---|---|
| Unique (%) | 65.9% |
Sample
| 1st row | 2017-05-16 15:22:12 |
|---|---|
| 2nd row | 2017-11-10 00:47:48 |
| 3rd row | 2017-11-10 00:47:48 |
| 4th row | 2017-05-08 22:22:56 |
| 5th row | 2018-02-04 22:29:19 |
Common Values
| Value | Count | Frequency (%) |
| 2018-02-24 03:20:27 | 23 | < 0.1% |
| 2017-10-18 13:06:21 | 22 | < 0.1% |
| 2017-07-17 18:25:23 | 21 | < 0.1% |
| 2017-01-30 22:33:45 | 15 | < 0.1% |
| 2017-12-15 02:30:41 | 15 | < 0.1% |
| 2018-06-08 19:31:06 | 15 | < 0.1% |
| 2017-11-24 10:31:10 | 15 | < 0.1% |
| 2018-02-22 11:48:42 | 14 | < 0.1% |
| 2018-04-14 02:31:43 | 13 | < 0.1% |
| 2018-04-19 22:11:43 | 13 | < 0.1% |
| Other values (89524) | 112191 | |
| (Missing) | 15 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| 2018-04-24 | 1115 | 0.5% |
| 2017-11-24 | 951 | 0.4% |
| 2017-11-25 | 875 | 0.4% |
| 2018-07-05 | 800 | 0.4% |
| 2017-11-28 | 582 | 0.3% |
| 2018-08-07 | 493 | 0.2% |
| 2018-05-08 | 487 | 0.2% |
| 2017-12-05 | 469 | 0.2% |
| 2018-08-20 | 466 | 0.2% |
| 2018-05-15 | 453 | 0.2% |
| Other values (42032) | 218023 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 360517 | |
| 1 | 345025 | |
| 2 | 273152 | |
| - | 224714 | |
| : | 224714 | |
| 112357 | 5.3% | |
| 8 | 111209 | 5.2% |
| 5 | 108307 | 5.1% |
| 3 | 105504 | 4.9% |
| 7 | 99292 | 4.7% |
| Other values (3) | 169992 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1572998 | |
| Dash Punctuation | 224714 | 10.5% |
| Other Punctuation | 224714 | 10.5% |
| Space Separator | 112357 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 360517 | |
| 1 | 345025 | |
| 2 | 273152 | |
| 8 | 111209 | 7.1% |
| 5 | 108307 | 6.9% |
| 3 | 105504 | 6.7% |
| 7 | 99292 | 6.3% |
| 4 | 78126 | 5.0% |
| 6 | 48467 | 3.1% |
| 9 | 43399 | 2.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 224714 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 224714 |
Space Separator
| Value | Count | Frequency (%) |
| 112357 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2134783 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 360517 | |
| 1 | 345025 | |
| 2 | 273152 | |
| - | 224714 | |
| : | 224714 | |
| 112357 | 5.3% | |
| 8 | 111209 | 5.2% |
| 5 | 108307 | 5.1% |
| 3 | 105504 | 4.9% |
| 7 | 99292 | 4.7% |
| Other values (3) | 169992 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2134783 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 360517 | |
| 1 | 345025 | |
| 2 | 273152 | |
| - | 224714 | |
| : | 224714 | |
| 112357 | 5.3% | |
| 8 | 111209 | 5.2% |
| 5 | 108307 | 5.1% |
| 3 | 105504 | 4.9% |
| 7 | 99292 | 4.7% |
| Other values (3) | 169992 |
order_delivered_carrier_date
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 80450 |
|---|---|
| Distinct (%) | 72.4% |
| Missing | 1184 |
| Missing (%) | 1.1% |
| Memory size | 1.7 MiB |
| 2018-05-09 15:48:00 | 48 |
|---|---|
| 2018-05-10 18:29:00 | 36 |
| 2017-10-20 19:09:07 | 22 |
| 2017-07-20 15:45:53 | 21 |
| 2018-08-08 15:01:00 | 21 |
| Other values (80445) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 2112572 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 62941 ? |
|---|---|
| Unique (%) | 56.6% |
Sample
| 1st row | 2017-05-23 10:47:57 |
|---|---|
| 2nd row | 2017-11-22 01:43:37 |
| 3rd row | 2017-11-22 01:43:37 |
| 4th row | 2017-05-19 20:16:31 |
| 5th row | 2018-02-19 18:21:47 |
Common Values
| Value | Count | Frequency (%) |
| 2018-05-09 15:48:00 | 48 | < 0.1% |
| 2018-05-10 18:29:00 | 36 | < 0.1% |
| 2017-10-20 19:09:07 | 22 | < 0.1% |
| 2017-07-20 15:45:53 | 21 | < 0.1% |
| 2018-08-08 15:01:00 | 21 | < 0.1% |
| 2018-05-07 12:31:00 | 21 | < 0.1% |
| 2018-03-02 00:18:01 | 20 | < 0.1% |
| 2018-06-08 14:40:00 | 19 | < 0.1% |
| 2018-05-10 14:28:00 | 18 | < 0.1% |
| 2018-08-15 12:53:00 | 18 | < 0.1% |
| Other values (80440) | 110944 | |
| (Missing) | 1184 | 1.1% |
Length
| Value | Count | Frequency (%) |
| 2017-11-28 | 878 | 0.4% |
| 2017-11-27 | 762 | 0.3% |
| 2017-11-29 | 675 | 0.3% |
| 2018-02-27 | 612 | 0.3% |
| 2018-03-27 | 588 | 0.3% |
| 2018-08-06 | 579 | 0.3% |
| 2017-11-30 | 551 | 0.2% |
| 2018-08-13 | 531 | 0.2% |
| 2018-05-14 | 530 | 0.2% |
| 2018-05-08 | 515 | 0.2% |
| Other values (37386) | 216155 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 385503 | |
| 1 | 329214 | |
| 2 | 262019 | |
| - | 222376 | |
| : | 222376 | |
| 8 | 117650 | 5.6% |
| 111188 | 5.3% | |
| 7 | 100960 | 4.8% |
| 3 | 93472 | 4.4% |
| 4 | 87465 | 4.1% |
| Other values (3) | 180349 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1556632 | |
| Dash Punctuation | 222376 | 10.5% |
| Other Punctuation | 222376 | 10.5% |
| Space Separator | 111188 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 385503 | |
| 1 | 329214 | |
| 2 | 262019 | |
| 8 | 117650 | 7.6% |
| 7 | 100960 | 6.5% |
| 3 | 93472 | 6.0% |
| 4 | 87465 | 5.6% |
| 5 | 85177 | 5.5% |
| 6 | 48927 | 3.1% |
| 9 | 46245 | 3.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 222376 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 222376 |
Space Separator
| Value | Count | Frequency (%) |
| 111188 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2112572 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 385503 | |
| 1 | 329214 | |
| 2 | 262019 | |
| - | 222376 | |
| : | 222376 | |
| 8 | 117650 | 5.6% |
| 111188 | 5.3% | |
| 7 | 100960 | 4.8% |
| 3 | 93472 | 4.4% |
| 4 | 87465 | 4.1% |
| Other values (3) | 180349 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2112572 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 385503 | |
| 1 | 329214 | |
| 2 | 262019 | |
| - | 222376 | |
| : | 222376 | |
| 8 | 117650 | 5.6% |
| 111188 | 5.3% | |
| 7 | 100960 | 4.8% |
| 3 | 93472 | 4.4% |
| 4 | 87465 | 4.1% |
| Other values (3) | 180349 |
order_delivered_customer_date
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 95022 |
|---|---|
| Distinct (%) | 86.4% |
| Missing | 2360 |
| Missing (%) | 2.1% |
| Memory size | 1.7 MiB |
| 2017-10-22 14:43:54 | 22 |
|---|---|
| 2017-07-31 18:03:02 | 21 |
| 2018-03-05 15:22:27 | 20 |
| 2017-12-13 20:19:35 | 15 |
| 2017-02-14 10:48:10 | 15 |
| Other values (95017) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 2090228 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 84401 ? |
|---|---|
| Unique (%) | 76.7% |
Sample
| 1st row | 2017-05-25 10:35:35 |
|---|---|
| 2nd row | 2017-11-28 00:09:50 |
| 3rd row | 2017-11-28 00:09:50 |
| 4th row | 2017-05-26 09:54:04 |
| 5th row | 2018-02-28 21:09:00 |
Common Values
| Value | Count | Frequency (%) |
| 2017-10-22 14:43:54 | 22 | < 0.1% |
| 2017-07-31 18:03:02 | 21 | < 0.1% |
| 2018-03-05 15:22:27 | 20 | < 0.1% |
| 2017-12-13 20:19:35 | 15 | < 0.1% |
| 2017-02-14 10:48:10 | 15 | < 0.1% |
| 2018-03-01 20:47:01 | 14 | < 0.1% |
| 2017-12-28 09:05:34 | 14 | < 0.1% |
| 2018-04-23 17:47:44 | 13 | < 0.1% |
| 2018-05-15 19:37:06 | 12 | < 0.1% |
| 2018-01-22 19:51:44 | 12 | < 0.1% |
| Other values (95012) | 109854 | |
| (Missing) | 2360 | 2.1% |
Length
| Value | Count | Frequency (%) |
| 2018-05-14 | 512 | 0.2% |
| 2018-08-13 | 498 | 0.2% |
| 2018-05-21 | 496 | 0.2% |
| 2018-08-27 | 494 | 0.2% |
| 2018-05-18 | 488 | 0.2% |
| 2017-12-11 | 476 | 0.2% |
| 2018-04-11 | 470 | 0.2% |
| 2018-05-03 | 463 | 0.2% |
| 2017-06-19 | 460 | 0.2% |
| 2018-07-30 | 454 | 0.2% |
| Other values (41602) | 215213 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 323126 | |
| 0 | 320225 | |
| 2 | 277020 | |
| - | 220024 | |
| : | 220024 | |
| 8 | 129261 | |
| 110012 | 5.3% | |
| 3 | 101890 | 4.9% |
| 7 | 101623 | 4.9% |
| 4 | 95107 | 4.6% |
| Other values (3) | 191916 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1540168 | |
| Dash Punctuation | 220024 | 10.5% |
| Other Punctuation | 220024 | 10.5% |
| Space Separator | 110012 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 323126 | |
| 0 | 320225 | |
| 2 | 277020 | |
| 8 | 129261 | |
| 3 | 101890 | 6.6% |
| 7 | 101623 | 6.6% |
| 4 | 95107 | 6.2% |
| 5 | 89012 | 5.8% |
| 6 | 55431 | 3.6% |
| 9 | 47473 | 3.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 220024 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 220024 |
Space Separator
| Value | Count | Frequency (%) |
| 110012 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2090228 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 323126 | |
| 0 | 320225 | |
| 2 | 277020 | |
| - | 220024 | |
| : | 220024 | |
| 8 | 129261 | |
| 110012 | 5.3% | |
| 3 | 101890 | 4.9% |
| 7 | 101623 | 4.9% |
| 4 | 95107 | 4.6% |
| Other values (3) | 191916 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2090228 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 323126 | |
| 0 | 320225 | |
| 2 | 277020 | |
| - | 220024 | |
| : | 220024 | |
| 8 | 129261 | |
| 110012 | 5.3% | |
| 3 | 101890 | 4.9% |
| 7 | 101623 | 4.9% |
| 4 | 95107 | 4.6% |
| Other values (3) | 191916 |
order_estimated_delivery_date
Categorical
| Distinct | 450 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| 2017-12-20 00:00:00 | 607 |
|---|---|
| 2018-03-12 00:00:00 | 593 |
| 2018-05-29 00:00:00 | 591 |
| 2018-03-13 00:00:00 | 589 |
| 2018-07-05 00:00:00 | 571 |
| Other values (445) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 2135068 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 19 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2017-06-05 00:00:00 |
|---|---|
| 2nd row | 2017-12-19 00:00:00 |
| 3rd row | 2017-12-19 00:00:00 |
| 4th row | 2017-06-12 00:00:00 |
| 5th row | 2018-03-22 00:00:00 |
Common Values
| Value | Count | Frequency (%) |
| 2017-12-20 00:00:00 | 607 | 0.5% |
| 2018-03-12 00:00:00 | 593 | 0.5% |
| 2018-05-29 00:00:00 | 591 | 0.5% |
| 2018-03-13 00:00:00 | 589 | 0.5% |
| 2018-07-05 00:00:00 | 571 | 0.5% |
| 2017-12-18 00:00:00 | 565 | 0.5% |
| 2017-12-19 00:00:00 | 565 | 0.5% |
| 2018-05-28 00:00:00 | 563 | 0.5% |
| 2018-02-14 00:00:00 | 558 | 0.5% |
| 2018-05-30 00:00:00 | 557 | 0.5% |
| Other values (440) | 106613 |
Length
| Value | Count | Frequency (%) |
| 00:00:00 | 112372 | |
| 2017-12-20 | 607 | 0.3% |
| 2018-03-12 | 593 | 0.3% |
| 2018-05-29 | 591 | 0.3% |
| 2018-03-13 | 589 | 0.3% |
| 2018-07-05 | 571 | 0.3% |
| 2017-12-18 | 565 | 0.3% |
| 2017-12-19 | 565 | 0.3% |
| 2018-05-28 | 563 | 0.3% |
| 2018-02-14 | 558 | 0.2% |
| Other values (441) | 107170 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 929457 | |
| - | 224744 | 10.5% |
| : | 224744 | 10.5% |
| 1 | 192328 | 9.0% |
| 2 | 175291 | 8.2% |
| 112372 | 5.3% | |
| 8 | 93359 | 4.4% |
| 7 | 68148 | 3.2% |
| 3 | 30055 | 1.4% |
| 5 | 23107 | 1.1% |
| Other values (3) | 61463 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1573208 | |
| Dash Punctuation | 224744 | 10.5% |
| Other Punctuation | 224744 | 10.5% |
| Space Separator | 112372 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 929457 | |
| 1 | 192328 | 12.2% |
| 2 | 175291 | 11.1% |
| 8 | 93359 | 5.9% |
| 7 | 68148 | 4.3% |
| 3 | 30055 | 1.9% |
| 5 | 23107 | 1.5% |
| 6 | 21757 | 1.4% |
| 4 | 21124 | 1.3% |
| 9 | 18582 | 1.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 224744 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 224744 |
Space Separator
| Value | Count | Frequency (%) |
| 112372 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2135068 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 929457 | |
| - | 224744 | 10.5% |
| : | 224744 | 10.5% |
| 1 | 192328 | 9.0% |
| 2 | 175291 | 8.2% |
| 112372 | 5.3% | |
| 8 | 93359 | 4.4% |
| 7 | 68148 | 3.2% |
| 3 | 30055 | 1.4% |
| 5 | 23107 | 1.1% |
| Other values (3) | 61463 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2135068 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 929457 | |
| - | 224744 | 10.5% |
| : | 224744 | 10.5% |
| 1 | 192328 | 9.0% |
| 2 | 175291 | 8.2% |
| 112372 | 5.3% | |
| 8 | 93359 | 4.4% |
| 7 | 68148 | 3.2% |
| 3 | 30055 | 1.4% |
| 5 | 23107 | 1.1% |
| Other values (3) | 61463 | 2.9% |
order_item_id
Real number (ℝ)
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1960097 |
| Minimum | 1 |
|---|---|
| Maximum | 21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 21 |
| Range | 20 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.69124255 |
|---|---|
| Coefficient of variation (CV) | 0.57795732 |
| Kurtosis | 92.983464 |
| Mean | 1.1960097 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.1912707 |
| Sum | 134398 |
| Variance | 0.47781626 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 98465 | |
| 2 | 9766 | 8.7% |
| 3 | 2270 | 2.0% |
| 4 | 959 | 0.9% |
| 5 | 456 | 0.4% |
| 6 | 252 | 0.2% |
| 7 | 58 | 0.1% |
| 8 | 36 | < 0.1% |
| 9 | 28 | < 0.1% |
| 10 | 25 | < 0.1% |
| Other values (11) | 57 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 98465 | |
| 2 | 9766 | 8.7% |
| 3 | 2270 | 2.0% |
| 4 | 959 | 0.9% |
| 5 | 456 | 0.4% |
| 6 | 252 | 0.2% |
| 7 | 58 | 0.1% |
| 8 | 36 | < 0.1% |
| 9 | 28 | < 0.1% |
| 10 | 25 | < 0.1% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 2 | < 0.1% |
| 19 | 2 | < 0.1% |
| 18 | 2 | < 0.1% |
| 17 | 2 | < 0.1% |
| 16 | 2 | < 0.1% |
| 15 | 4 | < 0.1% |
| 14 | 6 | |
| 13 | 7 | |
| 12 | 12 |
product_id
Categorical
| Distinct | 32789 |
|---|---|
| Distinct (%) | 29.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| aca2eb7d00ea1a7b8ebd4e68314663af | 524 |
|---|---|
| 422879e10f46682990de24d770e7f83d | 486 |
| 99a4788cb24856965c36a24e339b6058 | 482 |
| 389d119b48cf3043d311335e499d9c6b | 391 |
| 368c6c730842d78016ad823897a372db | 388 |
| Other values (32784) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3595904 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 17949 ? |
|---|---|
| Unique (%) | 16.0% |
Sample
| 1st row | a9516a079e37a9c9c36b9b78b10169e8 |
|---|---|
| 2nd row | a9516a079e37a9c9c36b9b78b10169e8 |
| 3rd row | a9516a079e37a9c9c36b9b78b10169e8 |
| 4th row | a9516a079e37a9c9c36b9b78b10169e8 |
| 5th row | a9516a079e37a9c9c36b9b78b10169e8 |
Common Values
| Value | Count | Frequency (%) |
| aca2eb7d00ea1a7b8ebd4e68314663af | 524 | 0.5% |
| 422879e10f46682990de24d770e7f83d | 486 | 0.4% |
| 99a4788cb24856965c36a24e339b6058 | 482 | 0.4% |
| 389d119b48cf3043d311335e499d9c6b | 391 | 0.3% |
| 368c6c730842d78016ad823897a372db | 388 | 0.3% |
| 53759a2ecddad2bb87a079a1f1519f73 | 373 | 0.3% |
| d1c427060a0f73f6b889a5c7c61f2ac4 | 340 | 0.3% |
| 53b36df67ebb7c41585e8d54d6772e08 | 320 | 0.3% |
| 154e7e31ebfa092203795c972e5804a6 | 292 | 0.3% |
| 3dd2a17168ec895c781a9191c1e95ad7 | 272 | 0.2% |
| Other values (32779) | 108504 |
Length
| Value | Count | Frequency (%) |
| aca2eb7d00ea1a7b8ebd4e68314663af | 524 | 0.5% |
| 422879e10f46682990de24d770e7f83d | 486 | 0.4% |
| 99a4788cb24856965c36a24e339b6058 | 482 | 0.4% |
| 389d119b48cf3043d311335e499d9c6b | 391 | 0.3% |
| 368c6c730842d78016ad823897a372db | 388 | 0.3% |
| 53759a2ecddad2bb87a079a1f1519f73 | 373 | 0.3% |
| d1c427060a0f73f6b889a5c7c61f2ac4 | 340 | 0.3% |
| 53b36df67ebb7c41585e8d54d6772e08 | 320 | 0.3% |
| 154e7e31ebfa092203795c972e5804a6 | 292 | 0.3% |
| 3dd2a17168ec895c781a9191c1e95ad7 | 272 | 0.2% |
| Other values (32779) | 108504 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 231245 | 6.4% |
| 9 | 228876 | 6.4% |
| e | 226947 | 6.3% |
| 7 | 226446 | 6.3% |
| 8 | 226313 | 6.3% |
| 4 | 225802 | 6.3% |
| a | 225307 | 6.3% |
| c | 224539 | 6.2% |
| 0 | 224455 | 6.2% |
| 2 | 224352 | 6.2% |
| Other values (6) | 1331622 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2257039 | |
| Lowercase Letter | 1338865 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 231245 | |
| 9 | 228876 | |
| 7 | 226446 | |
| 8 | 226313 | |
| 4 | 225802 | |
| 0 | 224455 | |
| 2 | 224352 | |
| 6 | 223663 | |
| 5 | 223639 | |
| 1 | 222248 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 226947 | |
| a | 225307 | |
| c | 224539 | |
| b | 223178 | |
| d | 221055 | |
| f | 217839 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2257039 | |
| Latin | 1338865 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 231245 | |
| 9 | 228876 | |
| 7 | 226446 | |
| 8 | 226313 | |
| 4 | 225802 | |
| 0 | 224455 | |
| 2 | 224352 | |
| 6 | 223663 | |
| 5 | 223639 | |
| 1 | 222248 |
Latin
| Value | Count | Frequency (%) |
| e | 226947 | |
| a | 225307 | |
| c | 224539 | |
| b | 223178 | |
| d | 221055 | |
| f | 217839 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3595904 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 231245 | 6.4% |
| 9 | 228876 | 6.4% |
| e | 226947 | 6.3% |
| 7 | 226446 | 6.3% |
| 8 | 226313 | 6.3% |
| 4 | 225802 | 6.3% |
| a | 225307 | 6.3% |
| c | 224539 | 6.2% |
| 0 | 224455 | 6.2% |
| 2 | 224352 | 6.2% |
| Other values (6) | 1331622 |
seller_id
Categorical
| Distinct | 3090 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| 6560211a19b47992c3666cc44a7e94c0 | 2020 |
|---|---|
| 4a3ca9315b744ce9f8e9374361493884 | 1984 |
| 1f50f920176fa81dab994f9023523100 | 1932 |
| cc419e0650a3c5ba77189a1882b7556a | 1811 |
| da8622b14eb17ae2831f4ac5b9dab84a | 1568 |
| Other values (3085) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3595904 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 504 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 7c67e1448b00f6e969d365cea6b010ab |
|---|---|
| 2nd row | 7c67e1448b00f6e969d365cea6b010ab |
| 3rd row | 7c67e1448b00f6e969d365cea6b010ab |
| 4th row | 7c67e1448b00f6e969d365cea6b010ab |
| 5th row | 7c67e1448b00f6e969d365cea6b010ab |
Common Values
| Value | Count | Frequency (%) |
| 6560211a19b47992c3666cc44a7e94c0 | 2020 | 1.8% |
| 4a3ca9315b744ce9f8e9374361493884 | 1984 | 1.8% |
| 1f50f920176fa81dab994f9023523100 | 1932 | 1.7% |
| cc419e0650a3c5ba77189a1882b7556a | 1811 | 1.6% |
| da8622b14eb17ae2831f4ac5b9dab84a | 1568 | 1.4% |
| 955fee9216a65b617aa5c0531780ce60 | 1489 | 1.3% |
| 1025f0e2d44d7041d6cf58b6550e0bfa | 1431 | 1.3% |
| 7c67e1448b00f6e969d365cea6b010ab | 1367 | 1.2% |
| ea8482cd71df3c1969d7b9473ff13abc | 1197 | 1.1% |
| 7a67c85e85bb2ce8582c35f2203ad736 | 1166 | 1.0% |
| Other values (3080) | 96407 |
Length
| Value | Count | Frequency (%) |
| 6560211a19b47992c3666cc44a7e94c0 | 2020 | 1.8% |
| 4a3ca9315b744ce9f8e9374361493884 | 1984 | 1.8% |
| 1f50f920176fa81dab994f9023523100 | 1932 | 1.7% |
| cc419e0650a3c5ba77189a1882b7556a | 1811 | 1.6% |
| da8622b14eb17ae2831f4ac5b9dab84a | 1568 | 1.4% |
| 955fee9216a65b617aa5c0531780ce60 | 1489 | 1.3% |
| 1025f0e2d44d7041d6cf58b6550e0bfa | 1431 | 1.3% |
| 7c67e1448b00f6e969d365cea6b010ab | 1367 | 1.2% |
| ea8482cd71df3c1969d7b9473ff13abc | 1197 | 1.1% |
| 7a67c85e85bb2ce8582c35f2203ad736 | 1166 | 1.0% |
| Other values (3080) | 96407 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 243702 | 6.8% |
| c | 237265 | 6.6% |
| 4 | 235579 | 6.6% |
| 6 | 231538 | 6.4% |
| 0 | 230731 | 6.4% |
| a | 229324 | 6.4% |
| b | 228696 | 6.4% |
| 3 | 228492 | 6.4% |
| 9 | 222943 | 6.2% |
| 2 | 222078 | 6.2% |
| Other values (6) | 1285556 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2273806 | |
| Lowercase Letter | 1322098 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 243702 | |
| 4 | 235579 | |
| 6 | 231538 | |
| 0 | 230731 | |
| 3 | 228492 | |
| 9 | 222943 | |
| 2 | 222078 | |
| 8 | 219820 | |
| 5 | 219763 | |
| 7 | 219160 |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 237265 | |
| a | 229324 | |
| b | 228696 | |
| e | 211716 | |
| f | 208617 | |
| d | 206480 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2273806 | |
| Latin | 1322098 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 243702 | |
| 4 | 235579 | |
| 6 | 231538 | |
| 0 | 230731 | |
| 3 | 228492 | |
| 9 | 222943 | |
| 2 | 222078 | |
| 8 | 219820 | |
| 5 | 219763 | |
| 7 | 219160 |
Latin
| Value | Count | Frequency (%) |
| c | 237265 | |
| a | 229324 | |
| b | 228696 | |
| e | 211716 | |
| f | 208617 | |
| d | 206480 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3595904 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 243702 | 6.8% |
| c | 237265 | 6.6% |
| 4 | 235579 | 6.6% |
| 6 | 231538 | 6.4% |
| 0 | 230731 | 6.4% |
| a | 229324 | 6.4% |
| b | 228696 | 6.4% |
| 3 | 228492 | 6.4% |
| 9 | 222943 | 6.2% |
| 2 | 222078 | 6.2% |
| Other values (6) | 1285556 |
shipping_limit_date
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 92643 |
|---|---|
| Distinct (%) | 82.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| 2017-10-24 13:06:21 | 22 |
|---|---|
| 2018-03-01 02:50:48 | 21 |
| 2017-07-21 18:25:23 | 21 |
| 2017-12-21 02:30:41 | 15 |
| 2017-02-03 21:44:49 | 15 |
| Other values (92638) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 2135068 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 78886 ? |
|---|---|
| Unique (%) | 70.2% |
Sample
| 1st row | 2017-05-22 15:22:12 |
|---|---|
| 2nd row | 2017-11-23 00:47:18 |
| 3rd row | 2017-11-23 00:47:18 |
| 4th row | 2017-05-22 22:22:56 |
| 5th row | 2018-02-18 21:29:19 |
Common Values
| Value | Count | Frequency (%) |
| 2017-10-24 13:06:21 | 22 | < 0.1% |
| 2018-03-01 02:50:48 | 21 | < 0.1% |
| 2017-07-21 18:25:23 | 21 | < 0.1% |
| 2017-12-21 02:30:41 | 15 | < 0.1% |
| 2017-02-03 21:44:49 | 15 | < 0.1% |
| 2017-11-30 10:30:51 | 15 | < 0.1% |
| 2018-02-28 11:48:12 | 14 | < 0.1% |
| 2018-06-13 17:30:35 | 13 | < 0.1% |
| 2018-04-25 22:11:43 | 13 | < 0.1% |
| 2018-04-19 02:30:52 | 13 | < 0.1% |
| Other values (92633) | 112210 |
Length
| Value | Count | Frequency (%) |
| 2017-11-30 | 1645 | 0.7% |
| 2017-12-07 | 756 | 0.3% |
| 2018-04-19 | 705 | 0.3% |
| 2018-03-08 | 662 | 0.3% |
| 2018-05-10 | 658 | 0.3% |
| 2018-01-18 | 657 | 0.3% |
| 2018-03-01 | 652 | 0.3% |
| 2018-08-07 | 649 | 0.3% |
| 2018-02-22 | 644 | 0.3% |
| 2018-03-22 | 627 | 0.3% |
| Other values (40510) | 217089 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 360810 | |
| 1 | 345224 | |
| 2 | 275312 | |
| - | 224744 | |
| : | 224744 | |
| 8 | 113258 | 5.3% |
| 112372 | 5.3% | |
| 3 | 108600 | 5.1% |
| 5 | 106431 | 5.0% |
| 7 | 96448 | 4.5% |
| Other values (3) | 167125 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1573208 | |
| Dash Punctuation | 224744 | 10.5% |
| Other Punctuation | 224744 | 10.5% |
| Space Separator | 112372 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 360810 | |
| 1 | 345224 | |
| 2 | 275312 | |
| 8 | 113258 | 7.2% |
| 3 | 108600 | 6.9% |
| 5 | 106431 | 6.8% |
| 7 | 96448 | 6.1% |
| 4 | 75487 | 4.8% |
| 6 | 47066 | 3.0% |
| 9 | 44572 | 2.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 224744 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 224744 |
Space Separator
| Value | Count | Frequency (%) |
| 112372 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2135068 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 360810 | |
| 1 | 345224 | |
| 2 | 275312 | |
| - | 224744 | |
| : | 224744 | |
| 8 | 113258 | 5.3% |
| 112372 | 5.3% | |
| 3 | 108600 | 5.1% |
| 5 | 106431 | 5.0% |
| 7 | 96448 | 4.5% |
| Other values (3) | 167125 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2135068 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 360810 | |
| 1 | 345224 | |
| 2 | 275312 | |
| - | 224744 | |
| : | 224744 | |
| 8 | 113258 | 5.3% |
| 112372 | 5.3% | |
| 3 | 108600 | 5.1% |
| 5 | 106431 | 5.0% |
| 7 | 96448 | 4.5% |
| Other values (3) | 167125 |
price
Real number (ℝ)
| Distinct | 5948 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 120.37896 |
| Minimum | 0.85 |
|---|---|
| Maximum | 6735 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0.85 |
|---|---|
| 5-th percentile | 17 |
| Q1 | 39.9 |
| median | 74.9 |
| Q3 | 134.9 |
| 95-th percentile | 349.9 |
| Maximum | 6735 |
| Range | 6734.15 |
| Interquartile range (IQR) | 95 |
Descriptive statistics
| Standard deviation | 182.15239 |
|---|---|
| Coefficient of variation (CV) | 1.513158 |
| Kurtosis | 109.42271 |
| Mean | 120.37896 |
| Median Absolute Deviation (MAD) | 42 |
| Skewness | 7.6696487 |
| Sum | 13527225 |
| Variance | 33179.492 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 59.9 | 2472 | 2.2% |
| 69.9 | 2000 | 1.8% |
| 49.9 | 1946 | 1.7% |
| 89.9 | 1545 | 1.4% |
| 99.9 | 1429 | 1.3% |
| 39.9 | 1325 | 1.2% |
| 29.9 | 1319 | 1.2% |
| 79.9 | 1209 | 1.1% |
| 19.9 | 1194 | 1.1% |
| 29.99 | 1171 | 1.0% |
| Other values (5938) | 96762 |
| Value | Count | Frequency (%) |
| 0.85 | 3 | < 0.1% |
| 1.2 | 20 | |
| 2.2 | 1 | < 0.1% |
| 2.29 | 1 | < 0.1% |
| 2.9 | 1 | < 0.1% |
| 2.99 | 1 | < 0.1% |
| 3 | 2 | < 0.1% |
| 3.06 | 3 | < 0.1% |
| 3.49 | 3 | < 0.1% |
| 3.5 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 6735 | 1 | |
| 6499 | 1 | |
| 4799 | 1 | |
| 4690 | 1 | |
| 4590 | 1 | |
| 4399.87 | 1 | |
| 4099.99 | 1 | |
| 4059 | 1 | |
| 3999.9 | 1 | |
| 3999 | 1 |
freight_value
Real number (ℝ)
| Distinct | 6976 |
|---|---|
| Distinct (%) | 6.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.977752 |
| Minimum | 0 |
|---|---|
| Maximum | 409.68 |
| Zeros | 382 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7.78 |
| Q1 | 13.07 |
| median | 16.25 |
| Q3 | 21.15 |
| 95-th percentile | 45.12 |
| Maximum | 409.68 |
| Range | 409.68 |
| Interquartile range (IQR) | 8.08 |
Descriptive statistics
| Standard deviation | 15.781421 |
|---|---|
| Coefficient of variation (CV) | 0.78994982 |
| Kurtosis | 60.037405 |
| Mean | 19.977752 |
| Median Absolute Deviation (MAD) | 3.6 |
| Skewness | 5.6452126 |
| Sum | 2244939.9 |
| Variance | 249.05326 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15.1 | 3700 | 3.3% |
| 7.78 | 2255 | 2.0% |
| 14.1 | 1879 | 1.7% |
| 11.85 | 1861 | 1.7% |
| 18.23 | 1572 | 1.4% |
| 7.39 | 1521 | 1.4% |
| 16.11 | 1159 | 1.0% |
| 15.23 | 1005 | 0.9% |
| 8.72 | 923 | 0.8% |
| 16.79 | 869 | 0.8% |
| Other values (6966) | 95628 |
| Value | Count | Frequency (%) |
| 0 | 382 | |
| 0.01 | 4 | < 0.1% |
| 0.02 | 3 | < 0.1% |
| 0.03 | 14 | < 0.1% |
| 0.04 | 4 | < 0.1% |
| 0.05 | 4 | < 0.1% |
| 0.06 | 11 | < 0.1% |
| 0.07 | 1 | < 0.1% |
| 0.08 | 12 | < 0.1% |
| 0.09 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 409.68 | 1 | |
| 375.28 | 2 | |
| 339.59 | 1 | |
| 338.3 | 1 | |
| 322.1 | 1 | |
| 321.88 | 1 | |
| 321.46 | 1 | |
| 317.47 | 1 | |
| 314.4 | 1 | |
| 314.02 | 1 |
review_id
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 97709 |
|---|---|
| Distinct (%) | 87.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| e8236fe7b6e1bdd513a500de361e2b87 | 21 |
|---|---|
| be332150a9c96e68c9565ea53cba2355 | 20 |
| 2e3a6e4930334530774ac3a6f6b62388 | 15 |
| d638a70f2be180ef55395eabb78fd88c | 15 |
| 03129dea7c12fa5878b2e629ccdf2ce6 | 14 |
| Other values (97704) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3595904 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 87398 ? |
|---|---|
| Unique (%) | 77.8% |
Sample
| 1st row | 88b8b52d46df026a9d1ad2136a59b30b |
|---|---|
| 2nd row | 7fc63200f12eebb5f387856afdd63db8 |
| 3rd row | 7fc63200f12eebb5f387856afdd63db8 |
| 4th row | 251191809e37c1cffc16865947c18a4d |
| 5th row | f7123bac5b91a0e2e38d8b41fd1206f4 |
Common Values
| Value | Count | Frequency (%) |
| e8236fe7b6e1bdd513a500de361e2b87 | 21 | < 0.1% |
| be332150a9c96e68c9565ea53cba2355 | 20 | < 0.1% |
| 2e3a6e4930334530774ac3a6f6b62388 | 15 | < 0.1% |
| d638a70f2be180ef55395eabb78fd88c | 15 | < 0.1% |
| 03129dea7c12fa5878b2e629ccdf2ce6 | 14 | < 0.1% |
| ee4bc8e340e8648a44c2e33fee6b27e4 | 14 | < 0.1% |
| ec530e1eb28f81c279430f63836c4c0c | 13 | < 0.1% |
| e8f500e8052dd5fac20fee5a8c880367 | 13 | < 0.1% |
| f676b4a89abc42681e4cd67dfb2621d5 | 12 | < 0.1% |
| 40b1644940367763775a63267ca6d957 | 12 | < 0.1% |
| Other values (97699) | 112223 |
Length
| Value | Count | Frequency (%) |
| e8236fe7b6e1bdd513a500de361e2b87 | 21 | < 0.1% |
| be332150a9c96e68c9565ea53cba2355 | 20 | < 0.1% |
| 2e3a6e4930334530774ac3a6f6b62388 | 15 | < 0.1% |
| d638a70f2be180ef55395eabb78fd88c | 15 | < 0.1% |
| 03129dea7c12fa5878b2e629ccdf2ce6 | 14 | < 0.1% |
| ee4bc8e340e8648a44c2e33fee6b27e4 | 14 | < 0.1% |
| ec530e1eb28f81c279430f63836c4c0c | 13 | < 0.1% |
| e8f500e8052dd5fac20fee5a8c880367 | 13 | < 0.1% |
| fede3915bdef46df7b0101af03518b71 | 12 | < 0.1% |
| d3f6a183fd58d4afd1b44a1bb410d7c2 | 12 | < 0.1% |
| Other values (97699) | 112223 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 225614 | 6.3% |
| 6 | 225546 | 6.3% |
| 5 | 225186 | 6.3% |
| 8 | 225111 | 6.3% |
| b | 224976 | 6.3% |
| f | 224963 | 6.3% |
| 0 | 224784 | 6.3% |
| d | 224761 | 6.3% |
| 2 | 224734 | 6.2% |
| c | 224515 | 6.2% |
| Other values (6) | 1345714 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2246606 | |
| Lowercase Letter | 1349298 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 225546 | |
| 5 | 225186 | |
| 8 | 225111 | |
| 0 | 224784 | |
| 2 | 224734 | |
| 9 | 224417 | |
| 4 | 224305 | |
| 1 | 224298 | |
| 7 | 224240 | |
| 3 | 223985 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 225614 | |
| b | 224976 | |
| f | 224963 | |
| d | 224761 | |
| c | 224515 | |
| e | 224469 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2246606 | |
| Latin | 1349298 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 225546 | |
| 5 | 225186 | |
| 8 | 225111 | |
| 0 | 224784 | |
| 2 | 224734 | |
| 9 | 224417 | |
| 4 | 224305 | |
| 1 | 224298 | |
| 7 | 224240 | |
| 3 | 223985 |
Latin
| Value | Count | Frequency (%) |
| a | 225614 | |
| b | 224976 | |
| f | 224963 | |
| d | 224761 | |
| c | 224515 | |
| e | 224469 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3595904 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 225614 | 6.3% |
| 6 | 225546 | 6.3% |
| 5 | 225186 | 6.3% |
| 8 | 225111 | 6.3% |
| b | 224976 | 6.3% |
| f | 224963 | 6.3% |
| 0 | 224784 | 6.3% |
| d | 224761 | 6.3% |
| 2 | 224734 | 6.2% |
| c | 224515 | 6.2% |
| Other values (6) | 1345714 |
review_score
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| 5 | |
|---|---|
| 4 | |
| 1 | |
| 3 | |
| 2 | 3874 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 112372 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 3 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 5 | 63525 | |
| 4 | 21315 | 19.0% |
| 1 | 14235 | 12.7% |
| 3 | 9423 | 8.4% |
| 2 | 3874 | 3.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 5 | 63525 | |
| 4 | 21315 | 19.0% |
| 1 | 14235 | 12.7% |
| 3 | 9423 | 8.4% |
| 2 | 3874 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 63525 | |
| 4 | 21315 | 19.0% |
| 1 | 14235 | 12.7% |
| 3 | 9423 | 8.4% |
| 2 | 3874 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 112372 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 63525 | |
| 4 | 21315 | 19.0% |
| 1 | 14235 | 12.7% |
| 3 | 9423 | 8.4% |
| 2 | 3874 | 3.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 112372 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 63525 | |
| 4 | 21315 | 19.0% |
| 1 | 14235 | 12.7% |
| 3 | 9423 | 8.4% |
| 2 | 3874 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 112372 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 63525 | |
| 4 | 21315 | 19.0% |
| 1 | 14235 | 12.7% |
| 3 | 9423 | 8.4% |
| 2 | 3874 | 3.4% |
review_comment_title
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 4497 |
|---|---|
| Distinct (%) | 33.5% |
| Missing | 98938 |
| Missing (%) | 88.0% |
| Memory size | 1.7 MiB |
| Recomendo | 471 |
|---|---|
| recomendo | 392 |
| Bom | 322 |
| super recomendo | 307 |
| Excelente | 281 |
| Other values (4492) |
Length
| Max length | 26 |
|---|---|
| Median length | 20 |
| Mean length | 12.172994 |
| Min length | 1 |
Characters and Unicode
| Total characters | 163532 |
|---|---|
| Distinct characters | 125 |
| Distinct categories | 14 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 3167 ? |
|---|---|
| Unique (%) | 23.6% |
Sample
| 1st row | Produto ruim |
|---|---|
| 2nd row | Negativa |
| 3rd row | Negativa |
| 4th row | preciso de ajuda |
| 5th row | preciso de ajuda |
Common Values
| Value | Count | Frequency (%) |
| Recomendo | 471 | 0.4% |
| recomendo | 392 | 0.3% |
| Bom | 322 | 0.3% |
| super recomendo | 307 | 0.3% |
| Excelente | 281 | 0.3% |
| Muito bom | 268 | 0.2% |
| Ótimo | 265 | 0.2% |
| Super recomendo | 251 | 0.2% |
| Ótimo | 231 | 0.2% |
| Otimo | 198 | 0.2% |
| Other values (4487) | 10448 | 9.3% |
| (Missing) | 98938 |
Length
| Value | Count | Frequency (%) |
| recomendo | 2389 | 9.3% |
| produto | 1481 | 5.8% |
| bom | 1462 | 5.7% |
| super | 1019 | 4.0% |
| muito | 1006 | 3.9% |
| não | 878 | 3.4% |
| ótimo | 794 | 3.1% |
| excelente | 739 | 2.9% |
| entrega | 674 | 2.6% |
| recebi | 420 | 1.6% |
| Other values (2083) | 14758 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 20356 | 12.4% |
| e | 17478 | 10.7% |
| 14516 | 8.9% | |
| r | 9423 | 5.8% |
| t | 9043 | 5.5% |
| a | 8760 | 5.4% |
| m | 8103 | 5.0% |
| d | 7864 | 4.8% |
| i | 7763 | 4.7% |
| n | 7343 | 4.5% |
| Other values (115) | 52883 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 127012 | |
| Uppercase Letter | 18080 | 11.1% |
| Space Separator | 14516 | 8.9% |
| Other Punctuation | 2600 | 1.6% |
| Decimal Number | 1224 | 0.7% |
| Other Symbol | 47 | < 0.1% |
| Dash Punctuation | 19 | < 0.1% |
| Modifier Symbol | 13 | < 0.1% |
| Math Symbol | 8 | < 0.1% |
| Close Punctuation | 6 | < 0.1% |
| Other values (4) | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 20356 | |
| e | 17478 | |
| r | 9423 | 7.4% |
| t | 9043 | 7.1% |
| a | 8760 | 6.9% |
| m | 8103 | 6.4% |
| d | 7864 | 6.2% |
| i | 7763 | 6.1% |
| n | 7343 | 5.8% |
| c | 5688 | 4.5% |
| Other values (31) | 25191 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2415 | |
| R | 1947 | |
| O | 1608 | 8.9% |
| P | 1529 | 8.5% |
| M | 1397 | 7.7% |
| N | 1138 | 6.3% |
| S | 1012 | 5.6% |
| A | 935 | 5.2% |
| Ó | 915 | 5.1% |
| B | 864 | 4.8% |
| Other values (26) | 4320 |
Other Punctuation
| Value | Count | Frequency (%) |
| ! | 1166 | |
| . | 757 | |
| * | 404 | 15.5% |
| , | 146 | 5.6% |
| ? | 49 | 1.9% |
| / | 39 | 1.5% |
| % | 21 | 0.8% |
| : | 6 | 0.2% |
| " | 3 | 0.1% |
| ; | 3 | 0.1% |
| Other values (4) | 6 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 460 | |
| 1 | 409 | |
| 5 | 81 | 6.6% |
| 2 | 77 | 6.3% |
| 8 | 46 | 3.8% |
| 3 | 40 | 3.3% |
| 4 | 37 | 3.0% |
| 9 | 29 | 2.4% |
| 7 | 24 | 2.0% |
| 6 | 21 | 1.7% |
Other Symbol
| Value | Count | Frequency (%) |
| 👍 | 16 | |
| 😍 | 9 | |
| 👏 | 6 | 12.8% |
| 🌟 | 6 | 12.8% |
| 💥 | 5 | 10.6% |
| 👎 | 1 | 2.1% |
| 🔟 | 1 | 2.1% |
| 🚚 | 1 | 2.1% |
| 🤗 | 1 | 2.1% |
| 😀 | 1 | 2.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 6 | |
| 🏻 | 3 | |
| 🏼 | 2 | 15.4% |
| 🏽 | 2 | 15.4% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 7 | |
| = | 1 | 12.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5 | |
| ] | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 14516 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Other Letter
| Value | Count | Frequency (%) |
| ª | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 145093 | |
| Common | 18439 | 11.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 20356 | |
| e | 17478 | |
| r | 9423 | 6.5% |
| t | 9043 | 6.2% |
| a | 8760 | 6.0% |
| m | 8103 | 5.6% |
| d | 7864 | 5.4% |
| i | 7763 | 5.4% |
| n | 7343 | 5.1% |
| c | 5688 | 3.9% |
| Other values (68) | 43272 |
Common
| Value | Count | Frequency (%) |
| 14516 | ||
| ! | 1166 | 6.3% |
| . | 757 | 4.1% |
| 0 | 460 | 2.5% |
| 1 | 409 | 2.2% |
| * | 404 | 2.2% |
| , | 146 | 0.8% |
| 5 | 81 | 0.4% |
| 2 | 77 | 0.4% |
| ? | 49 | 0.3% |
| Other values (37) | 374 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 160022 | |
| None | 3500 | 2.1% |
| Emoticons | 10 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 20356 | |
| e | 17478 | 10.9% |
| 14516 | 9.1% | |
| r | 9423 | 5.9% |
| t | 9043 | 5.7% |
| a | 8760 | 5.5% |
| m | 8103 | 5.1% |
| d | 7864 | 4.9% |
| i | 7763 | 4.9% |
| n | 7343 | 4.6% |
| Other values (75) | 49373 |
None
| Value | Count | Frequency (%) |
| ã | 1063 | |
| Ó | 915 | |
| á | 348 | 9.9% |
| ç | 334 | 9.5% |
| ó | 245 | 7.0% |
| é | 243 | 6.9% |
| Ã | 69 | 2.0% |
| í | 64 | 1.8% |
| ê | 43 | 1.2% |
| É | 28 | 0.8% |
| Other values (28) | 148 | 4.2% |
Emoticons
| Value | Count | Frequency (%) |
| 😍 | 9 | |
| 😀 | 1 | 10.0% |
review_comment_message
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 35692 |
|---|---|
| Distinct (%) | 74.9% |
| Missing | 64730 |
| Missing (%) | 57.6% |
| Memory size | 1.7 MiB |
| Muito bom | 254 |
|---|---|
| Bom | 200 |
| muito bom | 134 |
| bom | 117 |
| Otimo | 112 |
| Other values (35687) |
Length
| Max length | 208 |
|---|---|
| Median length | 159 |
| Mean length | 70.224844 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3345652 |
|---|---|
| Distinct characters | 208 |
| Distinct categories | 15 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 30107 ? |
|---|---|
| Unique (%) | 63.2% |
Sample
| 1st row | GOSTARIA DE UMA SOLUÇÃO, ESTOU PRECISANDO MUITO DO PRODUTO. |
|---|---|
| 2nd row | GOSTARIA DE UMA SOLUÇÃO, ESTOU PRECISANDO MUITO DO PRODUTO. |
| 3rd row | Produto compatível com seu valor, muito bonito e barato, simples, mas um bom custo benefício. |
| 4th row | Entregou antes do prazo |
| 5th row | Os encaixes para o encosto da cadeira estavam desalinhados. Deu trabalho pra encaixar e finalmente montar o produto. |
Common Values
| Value | Count | Frequency (%) |
| Muito bom | 254 | 0.2% |
| Bom | 200 | 0.2% |
| muito bom | 134 | 0.1% |
| bom | 117 | 0.1% |
| Otimo | 112 | 0.1% |
| Recomendo | 109 | 0.1% |
| otimo | 102 | 0.1% |
| Ok | 85 | 0.1% |
| Ótimo | 83 | 0.1% |
| Ótimo | 82 | 0.1% |
| Other values (35682) | 46364 | |
| (Missing) | 64730 |
Length
| Value | Count | Frequency (%) |
| o | 21460 | 3.8% |
| produto | 19929 | 3.5% |
| e | 18952 | 3.3% |
| a | 14323 | 2.5% |
| de | 13703 | 2.4% |
| do | 12468 | 2.2% |
| não | 12346 | 2.2% |
| que | 9780 | 1.7% |
| prazo | 9013 | 1.6% |
| muito | 8691 | 1.5% |
| Other values (19532) | 430230 |
Most occurring characters
| Value | Count | Frequency (%) |
| 529480 | ||
| o | 329883 | 9.9% |
| e | 320649 | 9.6% |
| a | 265180 | 7.9% |
| r | 188408 | 5.6% |
| i | 154357 | 4.6% |
| t | 152570 | 4.6% |
| d | 141384 | 4.2% |
| n | 129971 | 3.9% |
| s | 126390 | 3.8% |
| Other values (198) | 1007380 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2526996 | |
| Space Separator | 529480 | 15.8% |
| Uppercase Letter | 160240 | 4.8% |
| Other Punctuation | 91258 | 2.7% |
| Decimal Number | 20963 | 0.6% |
| Control | 13320 | 0.4% |
| Dash Punctuation | 937 | < 0.1% |
| Close Punctuation | 713 | < 0.1% |
| Open Punctuation | 697 | < 0.1% |
| Other Symbol | 675 | < 0.1% |
| Other values (5) | 373 | < 0.1% |
Most frequent character per category
Other Symbol
| Value | Count | Frequency (%) |
| 👏 | 233 | |
| 👍 | 86 | 12.7% |
| 😍 | 71 | 10.5% |
| ° | 27 | 4.0% |
| 😆 | 19 | 2.8% |
| 😉 | 19 | 2.8% |
| 😡 | 18 | 2.7% |
| 😘 | 16 | 2.4% |
| 👎 | 13 | 1.9% |
| 😁 | 13 | 1.9% |
| Other values (54) | 160 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 329883 | |
| e | 320649 | |
| a | 265180 | |
| r | 188408 | 7.5% |
| i | 154357 | 6.1% |
| t | 152570 | 6.0% |
| d | 141384 | 5.6% |
| n | 129971 | 5.1% |
| s | 126390 | 5.0% |
| m | 121057 | 4.8% |
| Other values (40) | 597147 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 18908 | |
| O | 17940 | |
| A | 16562 | |
| P | 11866 | 7.4% |
| R | 11640 | 7.3% |
| C | 9390 | 5.9% |
| M | 9055 | 5.7% |
| N | 9053 | 5.6% |
| S | 7939 | 5.0% |
| T | 7510 | 4.7% |
| Other values (31) | 40377 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 47977 | |
| , | 26579 | |
| ! | 12149 | 13.3% |
| / | 1753 | 1.9% |
| ? | 1550 | 1.7% |
| " | 412 | 0.5% |
| : | 296 | 0.3% |
| ; | 220 | 0.2% |
| % | 176 | 0.2% |
| * | 78 | 0.1% |
| Other values (5) | 68 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4828 | |
| 0 | 4793 | |
| 2 | 3997 | |
| 3 | 1892 | 9.0% |
| 4 | 1325 | 6.3% |
| 5 | 1183 | 5.6% |
| 8 | 910 | 4.3% |
| 6 | 874 | 4.2% |
| 7 | 705 | 3.4% |
| 9 | 456 | 2.2% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 84 | |
| = | 26 | 18.6% |
| | | 12 | 8.6% |
| < | 10 | 7.1% |
| ~ | 3 | 2.1% |
| × | 2 | 1.4% |
| > | 2 | 1.4% |
| ÷ | 1 | 0.7% |
Modifier Symbol
| Value | Count | Frequency (%) |
| 🏻 | 34 | |
| ´ | 26 | |
| 🏼 | 15 | |
| 🏽 | 13 | 12.7% |
| ^ | 8 | 7.8% |
| 🏾 | 4 | 3.9% |
| ` | 2 | 2.0% |
Control
| Value | Count | Frequency (%) |
| 6650 | ||
| 6650 | ||
| 20 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 710 | |
| ] | 3 | 0.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 691 | |
| [ | 6 | 0.9% |
Other Letter
| Value | Count | Frequency (%) |
| º | 26 | |
| ª | 19 |
Space Separator
| Value | Count | Frequency (%) |
| 529480 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 937 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 78 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2687281 | |
| Common | 658371 | 19.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 529480 | ||
| . | 47977 | 7.3% |
| , | 26579 | 4.0% |
| ! | 12149 | 1.8% |
| 6650 | 1.0% | |
| 6650 | 1.0% | |
| 1 | 4828 | 0.7% |
| 0 | 4793 | 0.7% |
| 2 | 3997 | 0.6% |
| 3 | 1892 | 0.3% |
| Other values (105) | 13376 | 2.0% |
Latin
| Value | Count | Frequency (%) |
| o | 329883 | |
| e | 320649 | |
| a | 265180 | 9.9% |
| r | 188408 | 7.0% |
| i | 154357 | 5.7% |
| t | 152570 | 5.7% |
| d | 141384 | 5.3% |
| n | 129971 | 4.8% |
| s | 126390 | 4.7% |
| m | 121057 | 4.5% |
| Other values (83) | 757432 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3285002 | |
| None | 60415 | 1.8% |
| Emoticons | 235 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 529480 | ||
| o | 329883 | 10.0% |
| e | 320649 | 9.8% |
| a | 265180 | 8.1% |
| r | 188408 | 5.7% |
| i | 154357 | 4.7% |
| t | 152570 | 4.6% |
| d | 141384 | 4.3% |
| n | 129971 | 4.0% |
| s | 126390 | 3.8% |
| Other values (85) | 946730 |
None
| Value | Count | Frequency (%) |
| ã | 17772 | |
| é | 10851 | |
| á | 8676 | |
| ç | 7132 | |
| ó | 5980 | 9.9% |
| ê | 1847 | 3.1% |
| í | 1671 | 2.8% |
| Ó | 1490 | 2.5% |
| õ | 901 | 1.5% |
| ú | 850 | 1.4% |
| Other values (73) | 3245 | 5.4% |
Emoticons
| Value | Count | Frequency (%) |
| 😍 | 71 | |
| 😆 | 19 | 8.1% |
| 😉 | 19 | 8.1% |
| 😡 | 18 | 7.7% |
| 😘 | 16 | 6.8% |
| 😁 | 13 | 5.5% |
| 😊 | 12 | 5.1% |
| 😀 | 8 | 3.4% |
| 😩 | 7 | 3.0% |
| 😃 | 6 | 2.6% |
| Other values (20) | 46 |
review_creation_date
Categorical
| Distinct | 633 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| 2017-12-19 00:00:00 | 516 |
|---|---|
| 2018-05-15 00:00:00 | 507 |
| 2018-05-19 00:00:00 | 497 |
| 2018-08-28 00:00:00 | 495 |
| 2017-12-20 00:00:00 | 489 |
| Other values (628) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 2135068 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 27 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2017-05-26 00:00:00 |
|---|---|
| 2nd row | 2017-11-29 00:00:00 |
| 3rd row | 2017-11-29 00:00:00 |
| 4th row | 2017-05-27 00:00:00 |
| 5th row | 2018-03-01 00:00:00 |
Common Values
| Value | Count | Frequency (%) |
| 2017-12-19 00:00:00 | 516 | 0.5% |
| 2018-05-15 00:00:00 | 507 | 0.5% |
| 2018-05-19 00:00:00 | 497 | 0.4% |
| 2018-08-28 00:00:00 | 495 | 0.4% |
| 2017-12-20 00:00:00 | 489 | 0.4% |
| 2018-05-22 00:00:00 | 487 | 0.4% |
| 2018-03-29 00:00:00 | 486 | 0.4% |
| 2018-08-14 00:00:00 | 472 | 0.4% |
| 2018-04-12 00:00:00 | 467 | 0.4% |
| 2018-05-04 00:00:00 | 467 | 0.4% |
| Other values (623) | 107489 |
Length
| Value | Count | Frequency (%) |
| 00:00:00 | 112281 | |
| 2017-12-19 | 516 | 0.2% |
| 2018-05-15 | 507 | 0.2% |
| 2018-05-19 | 497 | 0.2% |
| 2018-08-28 | 495 | 0.2% |
| 2017-12-20 | 489 | 0.2% |
| 2018-05-22 | 487 | 0.2% |
| 2018-03-29 | 486 | 0.2% |
| 2018-08-14 | 472 | 0.2% |
| 2018-04-12 | 467 | 0.2% |
| Other values (625) | 108047 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 926201 | |
| - | 224744 | 10.5% |
| : | 224744 | 10.5% |
| 1 | 195042 | 9.1% |
| 2 | 179503 | 8.4% |
| 112372 | 5.3% | |
| 8 | 90400 | 4.2% |
| 7 | 70105 | 3.3% |
| 3 | 27600 | 1.3% |
| 5 | 23953 | 1.1% |
| Other values (3) | 60404 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1573208 | |
| Dash Punctuation | 224744 | 10.5% |
| Other Punctuation | 224744 | 10.5% |
| Space Separator | 112372 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 926201 | |
| 1 | 195042 | 12.4% |
| 2 | 179503 | 11.4% |
| 8 | 90400 | 5.7% |
| 7 | 70105 | 4.5% |
| 3 | 27600 | 1.8% |
| 5 | 23953 | 1.5% |
| 4 | 22663 | 1.4% |
| 6 | 22279 | 1.4% |
| 9 | 15462 | 1.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 224744 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 224744 |
Space Separator
| Value | Count | Frequency (%) |
| 112372 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2135068 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 926201 | |
| - | 224744 | 10.5% |
| : | 224744 | 10.5% |
| 1 | 195042 | 9.1% |
| 2 | 179503 | 8.4% |
| 112372 | 5.3% | |
| 8 | 90400 | 4.2% |
| 7 | 70105 | 3.3% |
| 3 | 27600 | 1.3% |
| 5 | 23953 | 1.1% |
| Other values (3) | 60404 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2135068 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 926201 | |
| - | 224744 | 10.5% |
| : | 224744 | 10.5% |
| 1 | 195042 | 9.1% |
| 2 | 179503 | 8.4% |
| 112372 | 5.3% | |
| 8 | 90400 | 4.2% |
| 7 | 70105 | 3.3% |
| 3 | 27600 | 1.3% |
| 5 | 23953 | 1.1% |
| Other values (3) | 60404 | 2.8% |
review_answer_timestamp
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 97547 |
|---|---|
| Distinct (%) | 86.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| 2017-07-30 14:19:07 | 21 |
|---|---|
| 2018-03-12 12:46:07 | 20 |
| 2017-12-19 14:14:16 | 15 |
| 2017-02-16 17:14:41 | 15 |
| 2018-03-03 00:44:54 | 14 |
| Other values (97542) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 2135068 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 87111 ? |
|---|---|
| Unique (%) | 77.5% |
Sample
| 1st row | 2017-05-30 22:34:40 |
|---|---|
| 2nd row | 2017-12-01 19:41:59 |
| 3rd row | 2017-12-01 19:41:59 |
| 4th row | 2017-05-28 02:43:16 |
| 5th row | 2018-03-02 11:11:24 |
Common Values
| Value | Count | Frequency (%) |
| 2017-07-30 14:19:07 | 21 | < 0.1% |
| 2018-03-12 12:46:07 | 20 | < 0.1% |
| 2017-12-19 14:14:16 | 15 | < 0.1% |
| 2017-02-16 17:14:41 | 15 | < 0.1% |
| 2018-03-03 00:44:54 | 14 | < 0.1% |
| 2017-12-31 12:08:24 | 14 | < 0.1% |
| 2018-04-26 20:37:35 | 13 | < 0.1% |
| 2017-10-24 08:44:02 | 13 | < 0.1% |
| 2018-06-20 12:16:28 | 12 | < 0.1% |
| 2017-10-19 22:55:25 | 12 | < 0.1% |
| Other values (97537) | 112223 |
Length
| Value | Count | Frequency (%) |
| 2018-05-20 | 761 | 0.3% |
| 2018-05-21 | 673 | 0.3% |
| 2018-05-10 | 565 | 0.3% |
| 2017-12-20 | 438 | 0.2% |
| 2018-04-13 | 428 | 0.2% |
| 2017-12-13 | 415 | 0.2% |
| 2018-05-11 | 406 | 0.2% |
| 2018-08-24 | 396 | 0.2% |
| 2017-12-21 | 394 | 0.2% |
| 2018-08-31 | 390 | 0.2% |
| Other values (53772) | 219878 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 359552 | |
| 1 | 333159 | |
| 2 | 284979 | |
| - | 224744 | |
| : | 224744 | |
| 8 | 118619 | 5.6% |
| 112372 | 5.3% | |
| 3 | 104570 | 4.9% |
| 7 | 96566 | 4.5% |
| 5 | 88993 | 4.2% |
| Other values (3) | 186770 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1573208 | |
| Dash Punctuation | 224744 | 10.5% |
| Other Punctuation | 224744 | 10.5% |
| Space Separator | 112372 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 359552 | |
| 1 | 333159 | |
| 2 | 284979 | |
| 8 | 118619 | 7.5% |
| 3 | 104570 | 6.6% |
| 7 | 96566 | 6.1% |
| 5 | 88993 | 5.7% |
| 4 | 88786 | 5.6% |
| 6 | 50704 | 3.2% |
| 9 | 47280 | 3.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 224744 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 224744 |
Space Separator
| Value | Count | Frequency (%) |
| 112372 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2135068 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 359552 | |
| 1 | 333159 | |
| 2 | 284979 | |
| - | 224744 | |
| : | 224744 | |
| 8 | 118619 | 5.6% |
| 112372 | 5.3% | |
| 3 | 104570 | 4.9% |
| 7 | 96566 | 4.5% |
| 5 | 88993 | 4.2% |
| Other values (3) | 186770 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2135068 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 359552 | |
| 1 | 333159 | |
| 2 | 284979 | |
| - | 224744 | |
| : | 224744 | |
| 8 | 118619 | 5.6% |
| 112372 | 5.3% | |
| 3 | 104570 | 4.9% |
| 7 | 96566 | 4.5% |
| 5 | 88993 | 4.2% |
| Other values (3) | 186770 |
product_category_name
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 73 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1598 |
| Missing (%) | 1.4% |
| Memory size | 1.7 MiB |
| cama_mesa_banho | |
|---|---|
| beleza_saude | |
| esporte_lazer | |
| moveis_decoracao | |
| informatica_acessorios | |
| Other values (68) |
Length
| Max length | 46 |
|---|---|
| Median length | 32 |
| Mean length | 14.867415 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1646923 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | moveis_escritorio |
|---|---|
| 2nd row | moveis_escritorio |
| 3rd row | moveis_escritorio |
| 4th row | moveis_escritorio |
| 5th row | moveis_escritorio |
Common Values
| Value | Count | Frequency (%) |
| cama_mesa_banho | 11137 | 9.9% |
| beleza_saude | 9645 | 8.6% |
| esporte_lazer | 8640 | 7.7% |
| moveis_decoracao | 8331 | 7.4% |
| informatica_acessorios | 7849 | 7.0% |
| utilidades_domesticas | 6943 | 6.2% |
| relogios_presentes | 5950 | 5.3% |
| telefonia | 4517 | 4.0% |
| ferramentas_jardim | 4329 | 3.9% |
| automotivo | 4213 | 3.7% |
| Other values (63) | 39220 |
Length
| Value | Count | Frequency (%) |
| cama_mesa_banho | 11137 | 10.1% |
| beleza_saude | 9645 | 8.7% |
| esporte_lazer | 8640 | 7.8% |
| moveis_decoracao | 8331 | 7.5% |
| informatica_acessorios | 7849 | 7.1% |
| utilidades_domesticas | 6943 | 6.3% |
| relogios_presentes | 5950 | 5.4% |
| telefonia | 4517 | 4.1% |
| ferramentas_jardim | 4329 | 3.9% |
| automotivo | 4213 | 3.8% |
| Other values (63) | 39220 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 200532 | |
| a | 197630 | |
| s | 164170 | |
| o | 163030 | |
| i | 109507 | 6.6% |
| r | 106270 | 6.5% |
| _ | 104446 | 6.3% |
| t | 79354 | 4.8% |
| c | 78054 | 4.7% |
| m | 74127 | 4.5% |
| Other values (18) | 369803 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1542212 | |
| Connector Punctuation | 104446 | 6.3% |
| Decimal Number | 265 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 200532 | |
| a | 197630 | |
| s | 164170 | |
| o | 163030 | |
| i | 109507 | 7.1% |
| r | 106270 | 6.9% |
| t | 79354 | 5.1% |
| c | 78054 | 5.1% |
| m | 74127 | 4.8% |
| n | 56168 | 3.6% |
| Other values (16) | 313370 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 104446 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 265 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1542212 | |
| Common | 104711 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 200532 | |
| a | 197630 | |
| s | 164170 | |
| o | 163030 | |
| i | 109507 | 7.1% |
| r | 106270 | 6.9% |
| t | 79354 | 5.1% |
| c | 78054 | 5.1% |
| m | 74127 | 4.8% |
| n | 56168 | 3.6% |
| Other values (16) | 313370 |
Common
| Value | Count | Frequency (%) |
| _ | 104446 | |
| 2 | 265 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1646923 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 200532 | |
| a | 197630 | |
| s | 164170 | |
| o | 163030 | |
| i | 109507 | 6.6% |
| r | 106270 | 6.5% |
| _ | 104446 | 6.3% |
| t | 79354 | 4.8% |
| c | 78054 | 4.7% |
| m | 74127 | 4.5% |
| Other values (18) | 369803 |
product_name_lenght
Real number (ℝ)
| Distinct | 66 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1598 |
| Missing (%) | 1.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48.777583 |
| Minimum | 5 |
|---|---|
| Maximum | 76 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 29 |
| Q1 | 42 |
| median | 52 |
| Q3 | 57 |
| 95-th percentile | 60 |
| Maximum | 76 |
| Range | 71 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 10.025179 |
|---|---|
| Coefficient of variation (CV) | 0.20552841 |
| Kurtosis | 0.15452858 |
| Mean | 48.777583 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.90730409 |
| Sum | 5403288 |
| Variance | 100.50422 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 59 | 8302 | 7.4% |
| 60 | 7692 | 6.8% |
| 56 | 6512 | 5.8% |
| 58 | 6418 | 5.7% |
| 57 | 5985 | 5.3% |
| 55 | 5539 | 4.9% |
| 54 | 5245 | 4.7% |
| 53 | 4160 | 3.7% |
| 52 | 4144 | 3.7% |
| 49 | 3561 | 3.2% |
| Other values (56) | 53216 |
| Value | Count | Frequency (%) |
| 5 | 9 | < 0.1% |
| 6 | 3 | < 0.1% |
| 7 | 2 | < 0.1% |
| 8 | 4 | < 0.1% |
| 9 | 13 | < 0.1% |
| 10 | 4 | < 0.1% |
| 11 | 10 | < 0.1% |
| 12 | 37 | |
| 13 | 26 | |
| 14 | 45 |
| Value | Count | Frequency (%) |
| 76 | 1 | < 0.1% |
| 72 | 9 | < 0.1% |
| 69 | 1 | < 0.1% |
| 68 | 1 | < 0.1% |
| 67 | 3 | < 0.1% |
| 66 | 1 | < 0.1% |
| 64 | 163 | 0.1% |
| 63 | 1253 | |
| 62 | 153 | 0.1% |
| 61 | 233 | 0.2% |
product_description_lenght
Real number (ℝ)
| Distinct | 2958 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 1598 |
| Missing (%) | 1.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 786.79393 |
| Minimum | 4 |
|---|---|
| Maximum | 3992 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 161 |
| Q1 | 348 |
| median | 601 |
| Q3 | 985 |
| 95-th percentile | 2120 |
| Maximum | 3992 |
| Range | 3988 |
| Interquartile range (IQR) | 637 |
Descriptive statistics
| Standard deviation | 651.6095 |
|---|---|
| Coefficient of variation (CV) | 0.82818318 |
| Kurtosis | 4.9090181 |
| Mean | 786.79393 |
| Median Absolute Deviation (MAD) | 295 |
| Skewness | 2.0066937 |
| Sum | 87156311 |
| Variance | 424594.94 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 341 | 688 | 0.6% |
| 1893 | 633 | 0.6% |
| 348 | 619 | 0.6% |
| 903 | 577 | 0.5% |
| 492 | 577 | 0.5% |
| 245 | 541 | 0.5% |
| 366 | 521 | 0.5% |
| 236 | 486 | 0.4% |
| 340 | 465 | 0.4% |
| 919 | 421 | 0.4% |
| Other values (2948) | 105246 | |
| (Missing) | 1598 | 1.4% |
| Value | Count | Frequency (%) |
| 4 | 6 | |
| 8 | 2 | < 0.1% |
| 15 | 1 | < 0.1% |
| 20 | 6 | |
| 23 | 1 | < 0.1% |
| 26 | 2 | < 0.1% |
| 27 | 3 | < 0.1% |
| 28 | 2 | < 0.1% |
| 30 | 8 | |
| 31 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 3992 | 2 | < 0.1% |
| 3988 | 1 | < 0.1% |
| 3985 | 3 | |
| 3976 | 5 | |
| 3963 | 1 | < 0.1% |
| 3956 | 3 | |
| 3954 | 2 | < 0.1% |
| 3950 | 2 | < 0.1% |
| 3949 | 1 | < 0.1% |
| 3948 | 1 | < 0.1% |
product_photos_qty
Real number (ℝ)
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1598 |
| Missing (%) | 1.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.2071244 |
| Minimum | 1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.7197871 |
|---|---|
| Coefficient of variation (CV) | 0.7791981 |
| Kurtosis | 4.865013 |
| Mean | 2.2071244 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.9127487 |
| Sum | 244492 |
| Variance | 2.9576678 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 55943 | |
| 2 | 21947 | 19.5% |
| 3 | 12339 | 11.0% |
| 4 | 8383 | 7.5% |
| 5 | 5344 | 4.8% |
| 6 | 3758 | 3.3% |
| 7 | 1498 | 1.3% |
| 8 | 726 | 0.6% |
| 10 | 342 | 0.3% |
| 9 | 304 | 0.3% |
| Other values (9) | 190 | 0.2% |
| (Missing) | 1598 | 1.4% |
| Value | Count | Frequency (%) |
| 1 | 55943 | |
| 2 | 21947 | 19.5% |
| 3 | 12339 | 11.0% |
| 4 | 8383 | 7.5% |
| 5 | 5344 | 4.8% |
| 6 | 3758 | 3.3% |
| 7 | 1498 | 1.3% |
| 8 | 726 | 0.6% |
| 9 | 304 | 0.3% |
| 10 | 342 | 0.3% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 19 | 2 | < 0.1% |
| 18 | 4 | < 0.1% |
| 17 | 11 | < 0.1% |
| 15 | 12 | < 0.1% |
| 14 | 6 | < 0.1% |
| 13 | 30 | < 0.1% |
| 12 | 53 | < 0.1% |
| 11 | 71 | 0.1% |
| 10 | 342 |
product_weight_g
Real number (ℝ)
| Distinct | 2200 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2090.6109 |
| Minimum | 0 |
|---|---|
| Maximum | 40425 |
| Zeros | 8 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 125 |
| Q1 | 300 |
| median | 700 |
| Q3 | 1800 |
| 95-th percentile | 9750 |
| Maximum | 40425 |
| Range | 40425 |
| Interquartile range (IQR) | 1500 |
Descriptive statistics
| Standard deviation | 3748.6081 |
|---|---|
| Coefficient of variation (CV) | 1.7930683 |
| Kurtosis | 16.263327 |
| Mean | 2090.6109 |
| Median Absolute Deviation (MAD) | 500 |
| Skewness | 3.5994489 |
| Sum | 2.348885 × 108 |
| Variance | 14052063 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 200 | 6757 | 6.0% |
| 150 | 5250 | 4.7% |
| 250 | 4530 | 4.0% |
| 300 | 4237 | 3.8% |
| 400 | 3629 | 3.2% |
| 100 | 3511 | 3.1% |
| 350 | 3167 | 2.8% |
| 500 | 2693 | 2.4% |
| 600 | 2689 | 2.4% |
| 700 | 2035 | 1.8% |
| Other values (2190) | 73856 |
| Value | Count | Frequency (%) |
| 0 | 8 | < 0.1% |
| 2 | 5 | < 0.1% |
| 25 | 3 | < 0.1% |
| 50 | 948 | |
| 53 | 2 | < 0.1% |
| 54 | 1 | < 0.1% |
| 55 | 2 | < 0.1% |
| 58 | 1 | < 0.1% |
| 60 | 9 | < 0.1% |
| 61 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 40425 | 3 | < 0.1% |
| 30000 | 278 | |
| 29800 | 1 | < 0.1% |
| 29750 | 1 | < 0.1% |
| 29700 | 4 | < 0.1% |
| 29600 | 5 | < 0.1% |
| 29500 | 2 | < 0.1% |
| 29250 | 1 | < 0.1% |
| 29150 | 1 | < 0.1% |
| 29100 | 1 | < 0.1% |
product_length_cm
Real number (ℝ)
| Distinct | 99 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.152198 |
| Minimum | 7 |
|---|---|
| Maximum | 105 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 18 |
| median | 25 |
| Q3 | 38 |
| 95-th percentile | 62 |
| Maximum | 105 |
| Range | 98 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 16.139323 |
|---|---|
| Coefficient of variation (CV) | 0.53526191 |
| Kurtosis | 3.7356739 |
| Mean | 30.152198 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 1.7587665 |
| Sum | 3387720 |
| Variance | 260.47774 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 17523 | 15.6% |
| 20 | 10522 | 9.4% |
| 30 | 7541 | 6.7% |
| 17 | 5946 | 5.3% |
| 18 | 5720 | 5.1% |
| 25 | 4675 | 4.2% |
| 19 | 4662 | 4.1% |
| 40 | 4098 | 3.6% |
| 22 | 3831 | 3.4% |
| 50 | 2951 | 2.6% |
| Other values (89) | 44885 |
| Value | Count | Frequency (%) |
| 7 | 32 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 4 | < 0.1% |
| 10 | 8 | < 0.1% |
| 11 | 95 | 0.1% |
| 12 | 40 | < 0.1% |
| 13 | 60 | 0.1% |
| 14 | 134 | 0.1% |
| 15 | 201 | 0.2% |
| 16 | 17523 |
| Value | Count | Frequency (%) |
| 105 | 326 | |
| 104 | 28 | < 0.1% |
| 103 | 45 | < 0.1% |
| 102 | 59 | 0.1% |
| 101 | 107 | 0.1% |
| 100 | 379 | |
| 99 | 35 | < 0.1% |
| 98 | 47 | < 0.1% |
| 97 | 11 | < 0.1% |
| 96 | 8 | < 0.1% |
product_height_cm
Real number (ℝ)
| Distinct | 102 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.576811 |
| Minimum | 2 |
|---|---|
| Maximum | 105 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 8 |
| median | 13 |
| Q3 | 20 |
| 95-th percentile | 45 |
| Maximum | 105 |
| Range | 103 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 13.437792 |
|---|---|
| Coefficient of variation (CV) | 0.81063796 |
| Kurtosis | 7.3883945 |
| Mean | 16.576811 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 2.2558193 |
| Sum | 1862471 |
| Variance | 180.57426 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 9820 | 8.7% |
| 15 | 6563 | 5.8% |
| 20 | 6533 | 5.8% |
| 12 | 6245 | 5.6% |
| 11 | 6132 | 5.5% |
| 2 | 4996 | 4.4% |
| 8 | 4670 | 4.2% |
| 4 | 4659 | 4.1% |
| 5 | 4558 | 4.1% |
| 16 | 4549 | 4.0% |
| Other values (92) | 53629 |
| Value | Count | Frequency (%) |
| 2 | 4996 | |
| 3 | 2701 | 2.4% |
| 4 | 4659 | |
| 5 | 4558 | |
| 6 | 3394 | 3.0% |
| 7 | 4184 | |
| 8 | 4670 | |
| 9 | 3219 | 2.9% |
| 10 | 9820 | |
| 11 | 6132 |
| Value | Count | Frequency (%) |
| 105 | 133 | |
| 104 | 12 | < 0.1% |
| 103 | 49 | < 0.1% |
| 102 | 10 | < 0.1% |
| 100 | 41 | < 0.1% |
| 99 | 5 | < 0.1% |
| 98 | 3 | < 0.1% |
| 97 | 2 | < 0.1% |
| 96 | 8 | < 0.1% |
| 95 | 22 | < 0.1% |
product_width_cm
Real number (ℝ)
| Distinct | 95 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.00121 |
| Minimum | 6 |
|---|---|
| Maximum | 118 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 15 |
| median | 20 |
| Q3 | 30 |
| 95-th percentile | 45 |
| Maximum | 118 |
| Range | 112 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 11.707552 |
|---|---|
| Coefficient of variation (CV) | 0.50899722 |
| Kurtosis | 4.6498388 |
| Mean | 23.00121 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.7221613 |
| Sum | 2584278 |
| Variance | 137.06678 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 12047 | 10.7% |
| 11 | 10632 | 9.5% |
| 15 | 8935 | 8.0% |
| 16 | 8428 | 7.5% |
| 30 | 7601 | 6.8% |
| 12 | 5454 | 4.9% |
| 13 | 5254 | 4.7% |
| 14 | 4594 | 4.1% |
| 18 | 4065 | 3.6% |
| 40 | 3877 | 3.5% |
| Other values (85) | 41467 |
| Value | Count | Frequency (%) |
| 6 | 2 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 28 | < 0.1% |
| 9 | 50 | < 0.1% |
| 10 | 82 | 0.1% |
| 11 | 10632 | |
| 12 | 5454 | |
| 13 | 5254 | |
| 14 | 4594 | |
| 15 | 8935 |
| Value | Count | Frequency (%) |
| 118 | 7 | < 0.1% |
| 105 | 14 | < 0.1% |
| 104 | 1 | < 0.1% |
| 103 | 1 | < 0.1% |
| 102 | 2 | < 0.1% |
| 101 | 2 | < 0.1% |
| 100 | 42 | |
| 98 | 1 | < 0.1% |
| 97 | 1 | < 0.1% |
| 95 | 2 | < 0.1% |
| customer_zip_code_prefix | order_item_id | price | freight_value | product_name_lenght | product_description_lenght | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | customer_state | order_status | review_score | product_category_name | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| customer_zip_code_prefix | 1.000 | -0.009 | 0.071 | 0.469 | 0.016 | 0.029 | 0.027 | 0.028 | 0.013 | 0.019 | 0.001 | 0.896 | 0.023 | 0.042 | 0.051 |
| order_item_id | -0.009 | 1.000 | -0.116 | -0.055 | -0.020 | -0.032 | -0.066 | 0.001 | 0.007 | 0.018 | -0.004 | 0.000 | 0.004 | 0.042 | 0.029 |
| price | 0.071 | -0.116 | 1.000 | 0.434 | 0.040 | 0.209 | 0.028 | 0.514 | 0.266 | 0.326 | 0.271 | 0.019 | 0.013 | 0.011 | 0.113 |
| freight_value | 0.469 | -0.055 | 0.434 | 1.000 | 0.034 | 0.117 | 0.010 | 0.446 | 0.283 | 0.282 | 0.274 | 0.085 | 0.014 | 0.013 | 0.093 |
| product_name_lenght | 0.016 | -0.020 | 0.040 | 0.034 | 1.000 | 0.074 | 0.163 | 0.077 | 0.062 | -0.055 | 0.068 | 0.012 | 0.019 | 0.013 | 0.132 |
| product_description_lenght | 0.029 | -0.032 | 0.209 | 0.117 | 0.074 | 1.000 | 0.111 | 0.095 | -0.019 | 0.132 | -0.079 | 0.022 | 0.008 | 0.013 | 0.203 |
| product_photos_qty | 0.027 | -0.066 | 0.028 | 0.010 | 0.163 | 0.111 | 1.000 | 0.005 | 0.007 | -0.081 | -0.013 | 0.014 | 0.013 | 0.015 | 0.151 |
| product_weight_g | 0.028 | 0.001 | 0.514 | 0.446 | 0.077 | 0.095 | 0.005 | 1.000 | 0.618 | 0.531 | 0.620 | 0.015 | 0.006 | 0.020 | 0.199 |
| product_length_cm | 0.013 | 0.007 | 0.266 | 0.283 | 0.062 | -0.019 | 0.007 | 0.618 | 1.000 | 0.248 | 0.631 | 0.013 | 0.009 | 0.018 | 0.259 |
| product_height_cm | 0.019 | 0.018 | 0.326 | 0.282 | -0.055 | 0.132 | -0.081 | 0.531 | 0.248 | 1.000 | 0.340 | 0.016 | 0.014 | 0.018 | 0.278 |
| product_width_cm | 0.001 | -0.004 | 0.271 | 0.274 | 0.068 | -0.079 | -0.013 | 0.620 | 0.631 | 0.340 | 1.000 | 0.012 | 0.003 | 0.013 | 0.295 |
| customer_state | 0.896 | 0.000 | 0.019 | 0.085 | 0.012 | 0.022 | 0.014 | 0.015 | 0.013 | 0.016 | 0.012 | 1.000 | 0.025 | 0.049 | 0.034 |
| order_status | 0.023 | 0.004 | 0.013 | 0.014 | 0.019 | 0.008 | 0.013 | 0.006 | 0.009 | 0.014 | 0.003 | 0.025 | 1.000 | 0.132 | 0.029 |
| review_score | 0.042 | 0.042 | 0.011 | 0.013 | 0.013 | 0.013 | 0.015 | 0.020 | 0.018 | 0.018 | 0.013 | 0.049 | 0.132 | 1.000 | 0.054 |
| product_category_name | 0.051 | 0.029 | 0.113 | 0.093 | 0.132 | 0.203 | 0.151 | 0.199 | 0.259 | 0.278 | 0.295 | 0.034 | 0.029 | 0.054 | 1.000 |
| customer_id | customer_unique_id | customer_zip_code_prefix | customer_city | customer_state | order_id | order_status | order_purchase_timestamp | order_approved_at | order_delivered_carrier_date | order_delivered_customer_date | order_estimated_delivery_date | order_item_id | product_id | seller_id | shipping_limit_date | price | freight_value | review_id | review_score | review_comment_title | review_comment_message | review_creation_date | review_answer_timestamp | product_category_name | product_name_lenght | product_description_lenght | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 06b8999e2fba1a1fbc88172c00ba8bc7 | 861eff4711a542e4b93843c6dd7febb0 | 14409 | franca | SP | 00e7ee1b050b8499577073aeb2a297a1 | delivered | 2017-05-16 15:05:35 | 2017-05-16 15:22:12 | 2017-05-23 10:47:57 | 2017-05-25 10:35:35 | 2017-06-05 00:00:00 | 1 | a9516a079e37a9c9c36b9b78b10169e8 | 7c67e1448b00f6e969d365cea6b010ab | 2017-05-22 15:22:12 | 124.99 | 21.88 | 88b8b52d46df026a9d1ad2136a59b30b | 4 | NaN | NaN | 2017-05-26 00:00:00 | 2017-05-30 22:34:40 | moveis_escritorio | 41.0 | 1141.0 | 1.0 | 8683.0 | 54.0 | 64.0 | 31.0 |
| 1 | 8912fc0c3bbf1e2fbf35819e21706718 | 9eae34bbd3a474ec5d07949ca7de67c0 | 68030 | santarem | PA | c1d2b34febe9cd269e378117d6681172 | delivered | 2017-11-09 00:50:13 | 2017-11-10 00:47:48 | 2017-11-22 01:43:37 | 2017-11-28 00:09:50 | 2017-12-19 00:00:00 | 1 | a9516a079e37a9c9c36b9b78b10169e8 | 7c67e1448b00f6e969d365cea6b010ab | 2017-11-23 00:47:18 | 112.99 | 24.90 | 7fc63200f12eebb5f387856afdd63db8 | 1 | NaN | GOSTARIA DE UMA SOLUÇÃO, ESTOU PRECISANDO MUITO DO PRODUTO. | 2017-11-29 00:00:00 | 2017-12-01 19:41:59 | moveis_escritorio | 41.0 | 1141.0 | 1.0 | 8683.0 | 54.0 | 64.0 | 31.0 |
| 2 | 8912fc0c3bbf1e2fbf35819e21706718 | 9eae34bbd3a474ec5d07949ca7de67c0 | 68030 | santarem | PA | c1d2b34febe9cd269e378117d6681172 | delivered | 2017-11-09 00:50:13 | 2017-11-10 00:47:48 | 2017-11-22 01:43:37 | 2017-11-28 00:09:50 | 2017-12-19 00:00:00 | 2 | a9516a079e37a9c9c36b9b78b10169e8 | 7c67e1448b00f6e969d365cea6b010ab | 2017-11-23 00:47:18 | 112.99 | 24.90 | 7fc63200f12eebb5f387856afdd63db8 | 1 | NaN | GOSTARIA DE UMA SOLUÇÃO, ESTOU PRECISANDO MUITO DO PRODUTO. | 2017-11-29 00:00:00 | 2017-12-01 19:41:59 | moveis_escritorio | 41.0 | 1141.0 | 1.0 | 8683.0 | 54.0 | 64.0 | 31.0 |
| 3 | f0ac8e5a239118859b1734e1087cbb1f | 3c799d181c34d51f6d44bbbc563024db | 92480 | nova santa rita | RS | b1a5d5365d330d10485e0203d54ab9e8 | delivered | 2017-05-07 20:11:26 | 2017-05-08 22:22:56 | 2017-05-19 20:16:31 | 2017-05-26 09:54:04 | 2017-06-12 00:00:00 | 1 | a9516a079e37a9c9c36b9b78b10169e8 | 7c67e1448b00f6e969d365cea6b010ab | 2017-05-22 22:22:56 | 124.99 | 15.62 | 251191809e37c1cffc16865947c18a4d | 3 | NaN | Produto compatível com seu valor, muito bonito e barato, simples, mas um bom custo benefício. | 2017-05-27 00:00:00 | 2017-05-28 02:43:16 | moveis_escritorio | 41.0 | 1141.0 | 1.0 | 8683.0 | 54.0 | 64.0 | 31.0 |
| 4 | 6bc8d08963a135220ed6c6d098831f84 | 23397e992b09769faf5e66f9e171a241 | 25931 | mage | RJ | 2e604b3614664aa66867856dba7e61b7 | delivered | 2018-02-03 19:45:40 | 2018-02-04 22:29:19 | 2018-02-19 18:21:47 | 2018-02-28 21:09:00 | 2018-03-22 00:00:00 | 1 | a9516a079e37a9c9c36b9b78b10169e8 | 7c67e1448b00f6e969d365cea6b010ab | 2018-02-18 21:29:19 | 106.99 | 30.59 | f7123bac5b91a0e2e38d8b41fd1206f4 | 4 | NaN | Entregou antes do prazo | 2018-03-01 00:00:00 | 2018-03-02 11:11:24 | moveis_escritorio | 41.0 | 1141.0 | 1.0 | 8683.0 | 54.0 | 64.0 | 31.0 |
| 5 | fd3a0b1bd209f0e7d420c9c3d1127613 | 567ab47ca4deb92d46dbf54dce07d0a7 | 88460 | angelina | SC | 574fe1739f65af76badd0999db300b4f | delivered | 2017-03-23 15:10:17 | 2017-03-23 15:25:11 | 2017-03-28 18:23:51 | 2017-04-11 10:16:56 | 2017-04-24 00:00:00 | 1 | a9516a079e37a9c9c36b9b78b10169e8 | 7c67e1448b00f6e969d365cea6b010ab | 2017-04-05 15:25:11 | 126.99 | 15.06 | 1496c2a9c41a846ba946a98a09879660 | 4 | NaN | NaN | 2017-04-12 00:00:00 | 2017-04-15 15:34:45 | moveis_escritorio | 41.0 | 1141.0 | 1.0 | 8683.0 | 54.0 | 64.0 | 31.0 |
| 6 | fbd40c083aa8cddebb5265b2ba6aaf2e | f40ab89b622248b7ca125af4b486b887 | 32341 | contagem | MG | e0b26f14d2bcc710bb02f77a4628763b | delivered | 2017-05-16 10:00:49 | 2017-05-17 03:45:27 | 2017-05-23 10:35:48 | 2017-05-29 12:04:19 | 2017-06-07 00:00:00 | 1 | a9516a079e37a9c9c36b9b78b10169e8 | 7c67e1448b00f6e969d365cea6b010ab | 2017-05-23 03:45:27 | 124.99 | 30.71 | 23c661ff382c3a54fc8f98a0b627301f | 4 | NaN | Os encaixes para o encosto da cadeira estavam desalinhados. Deu trabalho pra encaixar e finalmente montar o produto. | 2017-05-30 00:00:00 | 2017-05-31 02:54:49 | moveis_escritorio | 41.0 | 1141.0 | 1.0 | 8683.0 | 54.0 | 64.0 | 31.0 |
| 7 | 10558ef4afea173bfb5e2cbe3d5b0bb5 | 749943913a9851a39c9baf51877fbab6 | 78134 | varzea grande | MT | eaae5bd20fb15d85aa673d9b7c0e8ca5 | delivered | 2017-03-18 23:04:36 | 2017-03-18 23:04:36 | 2017-03-28 06:29:47 | 2017-05-30 09:19:58 | 2017-04-27 00:00:00 | 1 | a9516a079e37a9c9c36b9b78b10169e8 | 7c67e1448b00f6e969d365cea6b010ab | 2017-03-29 23:04:36 | 126.99 | 21.34 | 3af0c84579dd7573c098da99a26dd1d7 | 2 | NaN | Bom dia! \r\nEstou insatisfeita com a segunda compra, pois era para ser entregue até 27/04/2017 e até o momento não a recebi. Já terminei de pagar e a mercadoria não me foi entregue. | 2017-04-29 00:00:00 | 2017-05-09 13:03:07 | moveis_escritorio | 41.0 | 1141.0 | 1.0 | 8683.0 | 54.0 | 64.0 | 31.0 |
| 8 | 1c37c0f0cd1d88d46d9fc9494762abbd | 432ecfa8b7b7ad2663c7abed0dc83c51 | 31270 | belo horizonte | MG | c4e2bd2043fbd75b325a47adfabf0d77 | delivered | 2018-03-15 07:45:10 | 2018-03-15 07:55:24 | 2018-03-27 15:58:41 | 2018-04-05 12:37:25 | 2018-04-19 00:00:00 | 1 | a9516a079e37a9c9c36b9b78b10169e8 | 7c67e1448b00f6e969d365cea6b010ab | 2018-03-29 07:55:24 | 116.99 | 33.08 | a82bdb93d19f3fd2dfc9c27140488965 | 3 | NaN | A entrega é muito demorada. | 2018-04-06 00:00:00 | 2018-04-06 16:02:36 | moveis_escritorio | 41.0 | 1141.0 | 1.0 | 8683.0 | 54.0 | 64.0 | 31.0 |
| 9 | 20a452f528d487411fd7d3ebda1d0f20 | 31318a0597cd9d50ce4cfd03c80fe780 | 37540 | santa rita do sapucai | MG | 1c7fe02ac4c7be50c59afb295cf85b89 | delivered | 2018-01-26 13:22:09 | 2018-01-30 03:47:31 | 2018-02-09 13:17:44 | 2018-02-20 14:18:51 | 2018-03-12 00:00:00 | 1 | a9516a079e37a9c9c36b9b78b10169e8 | 7c67e1448b00f6e969d365cea6b010ab | 2018-02-13 03:47:31 | 106.99 | 21.76 | 42b3cd1634f9613559f38187611136d3 | 2 | NaN | Uma cadeira veio faltando a parte do encosto. | 2018-02-21 00:00:00 | 2018-02-21 18:09:14 | moveis_escritorio | 41.0 | 1141.0 | 1.0 | 8683.0 | 54.0 | 64.0 | 31.0 |
| customer_id | customer_unique_id | customer_zip_code_prefix | customer_city | customer_state | order_id | order_status | order_purchase_timestamp | order_approved_at | order_delivered_carrier_date | order_delivered_customer_date | order_estimated_delivery_date | order_item_id | product_id | seller_id | shipping_limit_date | price | freight_value | review_id | review_score | review_comment_title | review_comment_message | review_creation_date | review_answer_timestamp | product_category_name | product_name_lenght | product_description_lenght | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 112362 | 0b7a30ba373aeb55cf28add5b5477956 | 8c8173e547e020f411aa55b2fceed861 | 87145 | paicandu | PR | e12f5458c6b4f349a97fbf22e08d17ac | delivered | 2017-08-01 12:57:02 | 2017-08-02 03:03:55 | 2017-08-02 17:47:19 | 2017-08-15 13:54:33 | 2017-08-23 00:00:00 | 1 | 494c9e9cadda96141fec59b218afc773 | 7142540dd4c91e2237acb7e911c4eba2 | 2017-08-08 03:03:55 | 59.90 | 17.67 | ea9662b9a368e2441a096b94f23e0794 | 5 | NaN | muito bom comprar com voces | 2017-08-16 00:00:00 | 2017-08-16 19:29:34 | automotivo | 57.0 | 651.0 | 5.0 | 16700.0 | 78.0 | 4.0 | 47.0 |
| 112363 | 0d7da1d5507a67dbb512b53744e775e9 | a5ba329297100ea689fa263768b35b8b | 95020 | caxias do sul | RS | 673242a6057d4287cb24379d405cf5ac | delivered | 2017-07-15 14:31:13 | 2017-07-15 14:43:29 | 2017-07-18 20:12:44 | 2017-07-28 18:32:36 | 2017-08-10 00:00:00 | 2 | 90ed53d34bfcb4fa1e90656068dd04bc | 1336efc61c316ddf92c899eb817f7cae | 2017-07-20 14:43:29 | 15.99 | 21.48 | 7c8b999b4f76c17ddbba8d4da8a6f37a | 3 | NaN | A garrafa de vidro ainda não chegou... mas os copos sim... vou aguardar | 2017-07-29 00:00:00 | 2017-08-02 12:21:42 | moveis_decoracao | 42.0 | 378.0 | 2.0 | 300.0 | 30.0 | 30.0 | 30.0 |
| 112364 | 0d7da1d5507a67dbb512b53744e775e9 | a5ba329297100ea689fa263768b35b8b | 95020 | caxias do sul | RS | 673242a6057d4287cb24379d405cf5ac | delivered | 2017-07-15 14:31:13 | 2017-07-15 14:43:29 | 2017-07-18 20:12:44 | 2017-07-28 18:32:36 | 2017-08-10 00:00:00 | 3 | 90ed53d34bfcb4fa1e90656068dd04bc | 1336efc61c316ddf92c899eb817f7cae | 2017-07-20 14:43:29 | 15.99 | 21.48 | 7c8b999b4f76c17ddbba8d4da8a6f37a | 3 | NaN | A garrafa de vidro ainda não chegou... mas os copos sim... vou aguardar | 2017-07-29 00:00:00 | 2017-08-02 12:21:42 | moveis_decoracao | 42.0 | 378.0 | 2.0 | 300.0 | 30.0 | 30.0 | 30.0 |
| 112365 | de10ff4100545d244951898150f9aad1 | 64aa56502b9ea3886fb356a259434201 | 73801 | formosa | GO | 925d5a139c3c9f4e3f028685b9a95617 | delivered | 2018-06-24 19:43:20 | 2018-06-24 20:16:03 | 2018-06-25 17:29:00 | 2018-07-03 02:24:49 | 2018-07-20 00:00:00 | 1 | ebd511983fe912346e316667a05c5cf9 | 6b90f847357d8981edd79a1eb1bf0acb | 2018-06-28 20:16:03 | 189.90 | 19.43 | 242111cdd392f2dfbcf3453b5cc83b3c | 1 | NaN | Produto n foi entregue e consta q recebi | 2018-07-04 00:00:00 | 2018-07-05 01:59:16 | consoles_games | 58.0 | 1278.0 | 1.0 | 200.0 | 20.0 | 11.0 | 16.0 |
| 112366 | e98998055b4804137a020830903f93f2 | e097d846931763789c3fd00b27f3c325 | 3057 | sao paulo | SP | 34041ba8694b6808060aafda57f6075e | delivered | 2017-03-20 18:33:07 | 2017-03-20 18:33:07 | 2017-03-21 15:07:41 | 2017-03-23 15:56:26 | 2017-04-06 00:00:00 | 1 | 1ce0e963805e3c170485768e8b09fe65 | 7722b1df1b0e383e000397b2c11e3e19 | 2017-03-24 18:33:07 | 39.90 | 11.74 | c6098e815f17530305cef4e09f25454b | 5 | NaN | Gostei muito do produto. | 2017-03-24 00:00:00 | 2017-03-25 11:03:30 | moveis_decoracao | 50.0 | 1342.0 | 3.0 | 800.0 | 25.0 | 25.0 | 25.0 |
| 112367 | f6c6d3e1e20969a5eed982163f959719 | fb354969e06f2093c0083cbfbb91864e | 1521 | sao paulo | SP | b2f58affcc178fea2daaf834f1acff5e | delivered | 2018-07-14 14:08:11 | 2018-07-17 04:31:33 | 2018-07-25 11:18:00 | 2018-07-26 19:18:32 | 2018-07-30 00:00:00 | 1 | 9682ad2500ae8b2609e6a88eb0cbc5bb | 0bf0150d5b9d60d9cd2906003332f085 | 2018-07-25 04:31:33 | 99.90 | 21.14 | 17e0e42ddf0dd7ebceff457b7c1da303 | 3 | NaN | NaN | 2018-07-27 00:00:00 | 2018-07-29 22:34:18 | casa_conforto | 32.0 | 373.0 | 1.0 | 1500.0 | 45.0 | 30.0 | 45.0 |
| 112368 | da37711b17efd5f2539e8196ab215f04 | 5f2971f9805e3ccb030226e30c8e8390 | 4313 | sao paulo | SP | c8203bb57639618630affac9e8e923dd | delivered | 2017-03-27 23:04:18 | 2017-03-27 23:23:38 | 2017-03-28 14:18:59 | 2017-03-30 17:06:19 | 2017-04-13 00:00:00 | 1 | aea06073397f809424f946979354c9f0 | f45122a9ab94eb4f3f8953578bc0c560 | 2017-04-02 23:23:38 | 19.99 | 10.96 | f3642bae4843d919c4c53ec1ff8fe26d | 4 | NaN | NaN | 2017-03-31 00:00:00 | 2017-04-04 01:37:20 | pet_shop | 43.0 | 779.0 | 1.0 | 300.0 | 16.0 | 16.0 | 16.0 |
| 112369 | 184e0c2cfc746789643521df0e9ff904 | c64ede6d0ae8901b1b6fb03528c1b7e6 | 68660 | sao miguel do guama | PA | 45b3000bcd10464ac178f32cd783fc83 | delivered | 2017-12-07 23:55:46 | 2017-12-09 23:50:32 | 2017-12-11 20:29:02 | 2018-01-05 17:59:38 | 2018-01-22 00:00:00 | 1 | bbf975bffd2ae9ee52f513ae5c8a4b27 | 04aba03279157f6d4e0fe8ccaf21963c | 2017-12-13 23:50:32 | 250.00 | 54.11 | 6dfe34036fd1f79225daeaca9cf083c2 | 4 | NaN | Tive um pouco de dificuldade pelo fato do número da residência está incorreto. Nos deu tudo certo. | 2018-01-06 00:00:00 | 2018-01-07 14:33:35 | beleza_saude | 40.0 | 1694.0 | 1.0 | 2400.0 | 33.0 | 10.0 | 16.0 |
| 112370 | 821a7275a08f32975caceff2e08ea262 | 046470763123d3d6364f89095b4e47ab | 5734 | sao paulo | SP | 49645a8902c1ba980836b7bff991d69f | delivered | 2018-04-04 17:50:52 | 2018-04-04 18:08:41 | 2018-04-05 16:36:49 | 2018-04-06 23:32:21 | 2018-04-18 00:00:00 | 1 | f6e0a9ce8a6e91c3a0ca2d3005911d20 | cab85505710c7cb9b720bceb52b01cee | 2018-04-10 18:08:41 | 84.90 | 7.46 | 260447daa5d738ced8f4e0bbee8a08a2 | 5 | NaN | NaN | 2018-04-07 00:00:00 | 2018-04-11 13:34:08 | fashion_bolsas_e_acessorios | 29.0 | 498.0 | 4.0 | 300.0 | 16.0 | 16.0 | 11.0 |
| 112371 | 1ed0c832c2dd99570a59260e71768bdf | 82d46759af0369aad49084bacf85a6c3 | 37610 | bom repouso | MG | 51c6d2f460589fa7b65f2da51e860206 | delivered | 2017-11-14 12:04:09 | 2017-11-14 12:15:25 | 2017-11-27 20:44:47 | 2017-12-19 19:37:33 | 2017-12-12 00:00:00 | 1 | c98bf47f7bea8f3aee82fa023786b8a1 | 51a04a8a6bdcb23deccc82b0b80742cf | 2017-11-24 12:15:25 | 167.99 | 31.93 | 603f2873842a6975a43c54d305397d69 | 1 | NaN | NaN | 2017-12-14 00:00:00 | 2017-12-16 13:50:11 | eletronicos | 33.0 | 63.0 | 1.0 | 6185.0 | 63.0 | 11.0 | 20.0 |